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COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND TREATMENT OF TUMOR 

FIELD OF THE INVENTION 
The present invention is directed to compositions of matter useful for the diagnosis and treatment of 
tumor in mammals and to methods of using those compositions of matter for the same. 

5 

BACKGROUND OF THE INVENTION 
Malignant tumors (cancers) are the second leading cause of death in the United States, after heart disease 
(Boring et al., CA Cancel J. Clin. 43:7 (1993)). Cancer is characterized by the increase in the number of 
abnormal, or neoplastic, cells derived from a normal tissue which proliferate to form a tumor mass, the invasion 

10 of adjacent tissues by these neoplastic tumor cells, and the generation of malignant cells which eventually spread 
via the blood or lymphatic system to regional lymph nodes and to distant sites via a process called metastasis. 
In a cancerous state, a cell proliferates under conditions in which normal cells would not grow. Cancer 
r manifests itself in a wide variety of forms, characterized by different degrees of invasiveness and aggressiveness. 

In attempts to discover effective cellular targets for cancer therapy, researchers have sought to identify 

15 polypeptides that are specifically overexpressed on the surface of a particular type of cancer call as compared 
to on one or more normal non-cancerous cell(s) . The identification of such tumor-associated cell surface antigen 
polypeptides has given rise to the ability to specifically target cancer cells for destruction via antibody-based 
therapies. In this regard, it is noted that antibody-based therapy has proved very effective in the treatment of 
certain cancers. For example, HERCEPTIN® and RITUXAN® (both from Genentech Inc. , South San Francisco, 

20 California) are antibodies that have been used successfully to treat breast cancer and non-Hodgkin' s lymphoma, 
respectively. More specifically, HERCEPTIN® is a recombinant DNA-derived humanized monoclonal antibody 
that selectively binds to the extracellular domain of the human epidermal growth factor receptor 2 (HER2) 
proto-oncogene. HER2 protein overexpression is observed in 25-30% of primary breast cancers. RITUXAN® 
is a genetically engineered chimeric murine/human monoclonal antibody directed against the CD20 antigen found 

25 on the surface of normal and malignant B lymphocytes. Both these antibodies are recombinantly produced in 
CHO cells. 

Despite these advances in mammalian cancer therapy, however, there is a great need for additional 
diagnostic and therapeutic agents capable of detecting the presence of tumor in a mammal and for effectively 
inhibiting neoplastic cell growth, respectively. Accordingly, it is the objective of the present invention to identify 
30 cell surface polypeptides that are overexpressed on certain cancer cells as compared to on normal cells or other 
different cancer cells, and to use those polypeptides, and their encoding nucleic acids, to produce compositions 
of matter useful in the therapeutic treatment and diagnostic detection of cancer in mammals. 
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SUMMARY OF THE INVENTION 

A. Embodiments 

In the present specification, Applicants describe for the first time the identification of various cellular 
polypeptides (and their encoding nucleic acids or fragments thereof) which are expressed to a greater degree on 
the surface of one or more types of cancer cell as compared to on the surface of one or more types of normal 
5 non-cancer cells. Such polypeptides are herein referred to as Tumor-associated Antigenic Target polypeptides 
("TAT" polypeptides) and are expected to serve as effective targets for cancer therapy and diagnosis in 
mammals. 

Accordingly, in one embodiment of the present invention, the invention provides an isolated nucleic acid 
molecule comprising a nucleotide sequence that encodes a tumor-associated antigenic target polypeptide or 

10 fragment thereof (a "TAT" polypeptide). 

In certain aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80 % nucleic acid sequence identity, alternatively at least about 8 1 % , 82 % , 83 % , 84 % , 85 % , 86 % , 87 % , 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity, to 
(a) a DNA molecule encoding a full-length TAT polypeptide having an amino acid sequence as disclosed herein, 

15 a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain 
„. of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any other 
~Z' specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, or (b) 
the complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 

20 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 
89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity, to (a) a 
DNA molecule comprising the coding sequence of a full-length TAT polypeptide cDNA as disclosed herein, the 
coding sequence of a TAT polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
extracellular domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed 

25 herein or the coding sequence of any other specifically defined fragment of the full-length TAT polypeptide 
amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In further aspects, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81 %, 82% , 83% , 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid 

30 sequence identity, to (a) a DNA molecule that encodes the same mature polypeptide encoded by the full-length 
coding sequence of any of the human protein cDNAs deposited with the ATCC as disclosed herein, or (b) the 
complement of the DNA molecule of (a). In this regard, the term "full-length coding sequence" refers to the 
TAT polypeptide-encoding nucleotide sequence of the cDNA that is inserted into the vector deposited with the 
ATCC (which is often shown between start and stop codons, inclusive thereof, in the accompanying figures). 

35 Another aspect of the invention provides an isolated nucleic acid molecule comprising a nucleotide 

sequence encoding a TAT polypeptide which is either transmembrane domain-deleted or transmembrane domain- 
inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) 



of such polypeptide(s) are disclosed herein. Therefore, soluble extracellular domains of the herein described 
TAT polypeptides are contemplated. 

In other aspects, the present invention is directed to isolated nucleic acid molecules which hybridize to 
(a) a nucleotide sequence encoding a TAT polypeptide having a full-length amino acid sequence as disclosed 
herein, a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
5 domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, 
or (b) the complement of the nucleotide sequence of (a) . In this regard, an embodiment of the present invention 
is directed to fragments of a full-length TAT polypeptide coding sequence, or the complement thereof, as 
disclosed herein, that may find use as, for example, hybridization probes useful as, for example, diagnostic 
10 probes, antisense oligonucleotide probes, or for encoding fragments of a full-length TAT polypeptide that may 
optionally encode a polypeptide comprising a binding site for an anti-TAT polypeptide antibody, a TAT binding 
oligopeptide or other small organic molecule that binds to a TAT polypeptide. Such nucleic acid fragments are 
£== usually at least about 5 nucleotides in length, alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
*0 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 
1$ 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 
iO 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 
^ 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 
in 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 
850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, 
S) wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% 
\ a 0 f mat referenced length. It is noted that novel fragments of a TAT polypeptide-encoding nucleotide sequence 
| C may be determined in a routine manner by aligning the TAT polypeptide-encoding nucleotide sequence with 
j other known nucleotide sequences using any of a number of well known sequence alignment programs and 

determining which TAT polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such novel 
25 fragments of TAT polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are 
the TAT polypeptide fragments encoded by these nucleotide molecule fragments, preferably those TAT 
polypeptide fragments that comprise a binding site for an anti-TAT antibody, a TAT binding oligopeptide or 
other small organic molecule that binds to a TAT polypeptide. 

In another embodiment, the invention provides isolated TAT polypeptide encoded by any of the isolated 
30 nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated TAT polypeptide, comprising an amino acid 
sequence having at least about 80 % amino acid sequence identity , alternatively at least about 8 1 % , 82 % , 83 % , 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% amino acid 
sequence identity, to a TAT polypeptide having a full-length amino acid sequence as disclosed herein, a TAT 
35 polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain of a 
transmembrane TAT polypeptide protein, with or without the signal peptide, as disclosed herein, an amino acid 
sequence encoded by any of the nucleic acid sequences disclosed herein or any other specifically defined 



fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated TAT polypeptide comprising an amino acid 
sequence having at least about 80 % amino acid sequence identity, alternatively at least about 8 1 % , 82 % , 83 % , 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% amino acid 
sequence identity, to an amino acid sequence encoded by any of the human protein cDNAs deposited with the 
5 ATCC as disclosed herein. 

In a specific aspect, the invention provides an isolated TAT polypeptide without the N-terminal signal 
sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein 
those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding 
10 nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and recovering the TAT 
polypeptide from the cell culture. 

Another aspect of the invention provides an isolated TAT polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
1 described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
15 appropriate encoding nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and 
~ recovering the TAT polypeptide from the cell culture. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
f encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells, E. coli, or yeast. A process for producing any of the herein 
20 described polypeptides is further provided and comprises culturing host cells under conditions suitable for 
expression of the desired polypeptide and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides isolated chimeric polypeptides comprising any of the 
~ herein described TAT polypeptides fused to a heterologous (non-TAT) polypeptide. Example of such chimeric 
molecules comprise any of the herein described TAT polypeptides fused to a heterologous polypeptide such as, 
25 for example, an epitope tag sequence or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which binds, preferably specifically, to any 
of the above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, antibody 
fragment, chimeric antibody, humanized antibody, or single-chain antibody. Antibodies of the present invention 
may optionally be conjugated to a growth inhibitory agent or cytotoxic agent such as a toxin, including, for 
30 example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. 
The antibodies of the present invention may optionally be produced in CHO cells or bacterial cells and preferably 
induce death of a cell to which they bind. For diagnostic purposes, the antibodies of the present invention may 
be detectably labeled, attached to a solid support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
35 encoding any of the herein described antibodies. Host cell comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells, E. coli, or yeast. A process for producing any of the herein 
described antibodies is further provided and comprises culturing host cells under conditions suitable for 



expression of the desired antibody and recovering the desired antibody from the cell culture. 

In another embodiment, the invention provides oligopeptides ("TAT binding oligopeptides") which 
bind, preferably specifically, to any of the above or below described TAT polypeptides. Optionally, the TAT 
binding oligopeptides of the present invention may be conjugated to a growth inhibitory agent or cytotoxic agent 
such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, 
5 a nucleolytic enzyme, or the like. The TAT binding oligopeptides of the present invention may optionally be 
produced in CHO cells or bacterial cells and preferably induce death of a cell to which they bind. For diagnostic 
purposes, the TAT binding oligopeptides of the present invention may be detectably labeled, attached to a solid 
support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
10 encoding any of the herein described TAT binding oligopeptides . Host cell comprising any such vector are also 
provided. By way of example, the host cells may be CHO cells, E. coli, or yeast. A process for producing any 
of the herein described TAT binding oligopeptides is further provided and comprises culturing host cells under 
conditions suitable for expression of the desired oligopeptide and recovering the desired oligopeptide from the 
If cell culture. 

13 In another embodiment, the invention provides small organic molecules ("TAT binding organic 

molecules") which bind, preferably specifically, to any of the above or below described TAT polypeptides. 
Optionally, the TAT binding organic molecules of the present invention may be conjugated to a growth inhibitory 
agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, 
a radioactive isotope, a nucleolytic enzyme, or the like. The TAT binding organic molecules of the present 

20 invention preferably induce death of a cell to which they bind. For diagnostic purposes, the TAT binding 
organic molecules of the present invention may be detectably labeled, attached to a solid support, or the like. 
^ In a still further embodiment, the invention concerns a composition of matter comprising a TAT 

polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody as 
described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 

25 described herein, in combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 

In yet another embodiment, the invention concerns an article of manufacture comprising a container and 
a composition of matter contained within the container, wherein the composition of matter may comprise a TAT 
polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody as 
described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 

3 0 described herein. The article may further optionally comprise a label affixed to the container, or a package insert 
included with the container, that refers to the use of the composition of matter for the therapeutic treatment or 
diagnostic detection of a tumor. 

Another embodiment of the present invention is directed to the use of a TAT polypeptide as described 
herein, a chimeric TAT polypeptide as described herein, an anti-TAT polypeptide antibody as described herein, 

35 a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as described herein, for 
the preparation of a medicament useful in the treatment of a condition which is responsive to the TAT 
polypeptide, chimeric TAT polypeptide, anti-TAT polypeptide antibody, TAT binding oligopeptide, or TAT 



binding organic molecule. 

B. Additional Embodiments 

Another embodiment of the present invention is directed to a method for killing a cancer cell that 
expresses a TAT polypeptide, wherein the method comprises contacting the cancer cell with an antibody, an 
oligopeptide or a small organic molecule that binds to the TAT polypeptide, thereby resulting in the death of the 
5 cancer cell. Optionally, the antibody is a monoclonal antibody, antibody fragment, chimeric antibody, 
humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and TAT binding organic 
molecules employed in the methods of the present invention may optionally be conjugated to a growth inhibitory 
agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, 
a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and TAT binding oligopeptides employed 

10 in the methods of the present invention may optionally be produced in CHO cells or bacterial cells. 

Yet another embodiment of the present invention is directed to a method of therapeutically treating a 
TAT polypeptide-expressing tumor in a mammal, wherein the method comprises administering to the mammal 
a therapeutically effective amount of an antibody, an oligopeptide or a small organic molecule that binds to the 
~~ TAT polypeptide, thereby resulting in the effective therapeutic treatment of the tumor. Optionally, the antibody 

15 is a monoclonal antibody, antibody fragment, chimeric antibody, humanized antibody, or single-chain antibody. 
Antibodies, TAT binding oligopeptides and TAT binding organic molecules employed in the methods of the 
present invention may optionally be conjugated to a growth inhibitory agent or cytotoxic agent such as a toxin, 
including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic 
enzyme, or the like. The antibodies and oligopeptides employed in the methods of the present invention may 

20 optionally be produced in CHO cells or bacterial cells. 

Yet another embodiment of the present invention is directed to a method of determining the presence 
', of a TAT polypeptide in a sample suspected of containing the TAT polypeptide, wherein the method comprises 
exposing the sample to an antibody, oligopeptide or small organic molecule that binds to the TAT polypeptide 
and determining binding of the antibody, oligopeptide or organic molecule to the TAT polypeptide in the sample, 

25 wherein the presence of such binding is indicative of the presence of the TAT polypeptide in the sample. 
Optionally, the sample may contain cells (which may be cancer cells) suspected of expressing the TAT 
polypeptide. The antibody, TAT binding oligopeptide or TAT binding organic molecule employed in the method 
may optionally be detectably labeled, attached to a solid support, or the like. 

A further embodiment of the present invention is directed to a method of diagnosing the presence of a 

30 tumor in a mammal, wherein the method comprises detecting the level of expression of a gene encoding a TAT 
polypeptide (a) in a test sample of tissue cells obtained from said mammal, and (b) in a control sample of known 
normal cells of the same tissue origin, wherein a higher level of expression of the TAT polypeptide in the test 
sample, as compared to the control sample, is indicative of the presence of tumor in the mammal from which 
the test sample was obtained. 

35 Another embodiment of the present invention is directed to a method of diagnosing the presence of a 

tumor in a mammal, wherein the method comprises (a) contacting a test sample of tissue cells obtained from the 
mammal with an antibody, oligopeptide or small organic molecule that binds to a TAT polypeptide and (b) 



detecting the formation of a complex between the antibody, oligopeptide or small organic molecule and the TAT 
polypeptide in the test sample, wherein the formation of a complex is indicative of the presence of a tumor in 
the mammal. Optionally, the antibody, TAT binding oligopeptide or TAT binding organic molecule employed 
is detectably labeled, attached to a solid support, or the like, and/or the test sample of tissue cells is obtained 
from an individual suspected of having a cancerous tumor. 
5 C. Further Additional Embodiments 

The following are further additional claim embodiments of the present invention. 

1 . Isolated nucleic acid having at least 80% nucleic acid sequence identity to: 

(a) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO:5), 
Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 
10 (b) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO:5), 

Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal 
peptide; 

=, (c) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 

(SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its 
associated signal peptide; 

(d) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 
(SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its 
associated signal peptide; 

(e) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 
20 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

* (f) the full-length coding sequence of the nucleotide sequence shown in Figure 1 (SEQ ID NO : 1 ) , Figure 

I 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(g) the full-length coding sequence of the cDNA deposited under any ATCC accession number shown 
in Table 7; or 

25 (h) the complement of (a), (b), (c), (d), (e), (f), or (g). 

2. Isolated nucleic acid comprising: 

(a) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO: 5), 
Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO:5), 
30 Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal 

peptide; 

(c) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 
(SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its 
associated signal peptide; 

35 (d) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 

(SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its 
associated signal peptide; 



(e) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 
(SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) the full-length coding sequence of the nucleotide sequence shown in Figure 1 (SEQ ID NO: 1) , Figure 
2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(g) the full-length coding sequence of the cDNA deposited under any ATCC accession number shown 
5 in Table 7; or 

(h) the complement of (a), (b), (c), (d), (e), (f), or (g). 

3. Isolated nucleic acid that hybridizes to: 

(a) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO:5), 
Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 
10 (b) a nucleotide sequence that encodes the amino acid sequence shown in Figure 5 (SEQ ID NO:5), 

Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal 
peptide; 

(c) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 
1 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its 

V5 associated signal peptide; 

(d) a nucleotide sequence that encodes the extracellular domain of the polypeptide shown in Figure 5 
I (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its 
=_ associated signal peptide; 

(e) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 
2D (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

•=■ (f) the full-length coding sequence of the nucleotide sequence shown in Figure 1 (SEQ ID NO : 1 ) , Figure 

* 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(g) the full-length coding sequence of the cDNA deposited under any ATCC accession number shown 
in Table 7; or 

25 (h) the complement of (a), (b), (c), (d), (e), (f), or (g). 

4. The nucleic acid of Claim 3, wherein the hybridization occurs under stringent conditions. 

5. The nucleic acid of Claim 3 which is at least about 5 nucleotides in length. 

6. An expression vector comprising the nucleic acid of Claim 1. 

7. The expression vector of Claim 6, wherein said nucleic acid is operably linked to control 
30 sequences recognized by a host cell transformed with the vector. 

8. A host cell comprising the expression vector of Claim 7. 

9. The host cell of Claim 8 which is a CHO cell, an E. coli cell or a yeast cell. 

10. A process for producing a polypeptide comprising culturing the host cell of Claim 8 under 
conditions suitable for expression of said polypeptide and recovering said polypeptide from the cell culture. 

35 11. An isolated polypeptide having at least 80 % amino acid sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO: 5), Figure 6 (SEQ ID NO: 6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

8 



(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

5 (d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 

NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO: 1), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
10 (f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 

in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
or 

:==, (g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 

;p any ATCC accession number shown in Table 7. 

12. An isolated polypeptide comprising: 

=|y (a) the amino acid sequence shown in Figure 5 (SEQ ID NO: 5), Figure 6 (SEQ ID NO: 6), Figure 7 

^| (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

,p (b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 

! _ (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

20 (c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 

H NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
j £ signal peptide; 

!=& (d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 

NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
25 signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

30 or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

13. A chimeric polypeptide comprising the polypeptide of Claim 11 fused to a heterologous 
polypeptide. 

35 14. The chimeric polypeptide of Claim 13 , wherein said heterologous polypeptide is an epitope tag 

sequence or an Fc region of an immunoglobulin. 
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15. An isolated antibody which binds to a polypeptide having at least 80% amino acid sequence 
identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO: 5), Figure 6 (SEQ ID NO: 6), Figure 7 
(SEQ ID NO: 7), or Figure 8 (SEQ ID NO: 8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

16. The antibody of Claim 15 which binds to a polypeptide comprising: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO: 5), Figure 6 (SEQ ID NO: 6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

17. The antibody of Claim 15 which is a monoclonal antibody. 
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18. The antibody of Claim 15 which is an antibody fragment. 

19. The antibody of Claim 15 which is a chimeric or a humanized antibody. 

20. The antibody of Claim 15 which is conjugated to a growth inhibitory agent. 

21. The antibody of Claim 15 which is conjugated to a cytotoxic agent. 

22. The antibody of Claim 21, wherein the cytotoxic agent is selected from the group consisting 
of toxins, antibiotics, radioactive isotopes and nucleolytic enzymes. 

23. The antibody of Claim 21, wherein the cytotoxic agent is a toxin. 

24. The antibody of Claim 23, wherein the toxin is selected from the group consisting of 
maytansinoid and calicheamicin. 

25. The antibody of Claim 23, wherein the toxin is a maytansinoid. 

26. The antibody of Claim 15 which is produced in bacteria. 

27. The antibody of Claim 15 which is produced in CHO cells. 

28. The antibody of Claim 15 which induces death of a cell to which it binds. 

29. The antibody of Claim 15 which is detectably labeled. 

30. An isolated nucleic acid comprising a nucleotide sequence that encodes the antibody of Claim 

15. 

31. An expression vector comprising the nucleic acid of Claim 30 operably linked to control 
sequences recognized by a host cell transformed with the vector. 

32. A host cell comprising the expression vector of Claim 3 1 . 

33. The host cell of Claim 32 which is a CHO cell, an E. coli cell or a yeast cell. 

34. A process for producing an antibody comprising culturing the host cell of Claim 32 under 
conditions suitable for expression of said antibody and recovering said antibody from the cell culture. 

35 . An isolated oligopeptide which binds to a polypeptide having at least 80 % amino acid sequence 
identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
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or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

36. The oligopeptide of Claim 35 which binds to a polypeptide comprising: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO: 6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 
signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

37. The oligopeptide of Claim 35 which is conjugated to a growth inhibitory agent. 

38. The oligopeptide of Claim 35 which is conjugated to a cytotoxic agent. 

39. The oligopeptide of Claim 38, wherein the cytotoxic agent is selected from the group consisting 
of toxins, antibiotics, radioactive isotopes and nucleolytic enzymes. 

40. The oligopeptide of Claim 38, wherein the cytotoxic agent is a toxin. 

41. The oligopeptide of Claim 40, wherein the toxin is selected from the group consisting of 
maytansinoid and calicheamicin. 

42. The oligopeptide of Claim 40, wherein the toxin is a maytansinoid. 

43. The oligopeptide of Claim 35 which induces death of a cell to which it binds. 

44. The oligopeptide of Claim 35 which is detectably labeled. 

45 . A TAT binding organic molecule which binds to a polypeptide having at least 80 % amino acid 
sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 
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(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 
signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 

5 signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

10 or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
any ATCC accession number shown in Table 7. 

46. The organic molecule of Claim 45 which binds to a polypeptide comprising: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
15 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); 

(b) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
2 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated signal peptide; 

(c) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), with its associated 

20 signal peptide; 

(d) an amino acid sequence of the extracellular domain of the polypeptide shown in Figure 5 (SEQ ID 
~ NO:5), Figure 6 (SEQ ID NO:6), Figure 7 (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8), lacking its associated 

signal peptide; 

(e) an amino acid sequence encoded by the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
25 Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 

(f) an amino acid sequence encoded by the full-length coding sequence of the nucleotide sequence shown 
in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4); 
or 

(g) an amino acid sequence encoded by the full-length coding sequence of the cDNA deposited under 
30 any ATCC accession number shown in Table 7. 

47. The organic molecule of Claim 45 which is conjugated to a growth inhibitory agent. 

48. The organic molecule of Claim 45 which is conjugated to a cytotoxic agent. 

49. The organic molecule of Claim 48, wherein the cytotoxic agent is selected from the group 
consisting of toxins, antibiotics, radioactive isotopes and nucleolytic enzymes. 

35 50. The organic molecule of Claim 48, wherein the cytotoxic agent is a toxin. 

5 1 . The oragnic molecule of Claim 50, wherein the toxin is selected from the group consisting of 
maytansinoid and calicheamicin. 
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52. The organic molecule of Claim 50, wherein the toxin is a maytansinoid. 

53. The organic molecule of Claim 45 which induces death of a cell to which it binds. 

54. The organic molecule of Claim 45 which is detectably labeled. 

55. A composition of matter comprising: 
(a) the polypeptide of Claim 1 1 ; 

5 (b) the chimeric polypeptide of Claim 13; 

(c) the antibody of Claim 15, 

(d) the oligopeptide of Claim 35; or 

(e) the TAT binding organic molecule of Claim 45, in combination with a carrier. 

56. The composition of matter of Claim 55 , wherein said carrier is a pharmaceutically acceptable 

10 carrier. 

57. An article of manufacture: 

(a) a container; and 

(b) the composition of matter of Claim 55 contained within said container. 

58. The article of manufacture of Claim 57 further comprising a label affixed to said container, 
15 or a package insert included with said container, referring to the use of said composition of matter for the 

therapeutic treatment of or the diagnostic detection of a cancer. 

59. A method of killing a cancer cell that expresses a polypeptide having at least 80% amino acid 
sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
W (SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); or 

(b) an amino acid sequence encoded by a nucleotide sequence comprising the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4), 
said method comprising contacting said cancer cell with an antibody, oligopeptide or organic molecule that binds 
to said polypeptide on said cancer cell, thereby killing said cancer cell. 

25 60. The method of Claim 59, wherein said antibody is a monoclonal antibody. 

61. The method of Claim 59, wherein said antibody is an antibody fragment. 

62. The method of Claim 59, wherein said antibody is a chimeric or a humanized antibody. 

63. The method of Claim 59, wherein said antibody, oligopeptide or organic molecule is conjugated 
to a growth inhibitory agent. 

30 64. The method of Claim 59, wherein said antibody, oligopeptide or organic molecule is conjugated 

to a cytotoxic agent. 

65. The method of Claim 64, wherein said cytotoxic agent is selected from the group consisting 
of toxins, antibiotics, radioactive isotopes and nucleolytic enzymes. 

66. The method of Claim 64, wherein the cytotoxic agent is a toxin. 

35 67. The method of Claim 66, wherein the toxin is selected from the group consisting of 

maytansinoid and calicheamicin. 

68. The method of Claim 66, wherein the toxin is a maytansinoid. 
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69. The method of Claim 59, wherein said antibody is produced in bacteria. 

70. The method of Claim 59, wherein said antibody is produced in CHO cells. 

71 . The method of Claim 59, wherein said cancer cell is further exposed to radiation treatment or 
a chemotherapeutic agent. 

72. The method of Claim 59, wherein said cancer cell is selected from the group consisting of a 
breast cancer cell, a colorectal cancer cell, a lung cancer cell, an ovarian cancer cell, a central nervous system 
cancer cell, a liver cancer cell, a bladder cancer cell, a pancreatic cancer cell, a cervical cancer cell, a melanoma 
cell and a leukemia cell. 

73 . The method of Claim 5 9 , wherein said cancer cell overexpresses said polypeptide as compared 
to a normal cell of the same tissue origin. 

74. A method of therapeutically treating a mammal having a tumor comprising cells that express 
a polypeptide having at least 80% amino acid sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); or 

(b) an amino acid sequence encoded by a nucleotide sequence comprising the nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4), 
said method comprising administering to said mammal a therapeutically effective amount of an antibody, 
oligopeptide or organic molecule that binds to said polypeptide, thereby effectively treating said mammal. 

75. The method of Claim 74, wherein said antibody is a monoclonal antibody. 

76. The method of Claim 74, wherein said antibody is an antibody fragment. 

77. The method of Claim 74, wherein said antibody is a chimeric or a humanized antibody. 

78. The method of Claim 74, wherein said antibody, oligopeptide or organic molecule is conjugated 
to a growth inhibitory agent. 

79 . The method of Claim 74 , wherein said antibody , oligopeptide or organic molecule is conjugated 
to a cytotoxic agent. 

80. The method of Claim 79, wherein said cytotoxic agent is selected from the group consisting 
of toxins, antibiotics, radioactive isotopes and nucleolytic enzymes. 

81 . The method of Claim 79, wherein the cytotoxic agent is a toxin. 

82. The method of Claim 81, wherein the toxin is selected from the group consisting of 
maytansinoid and calicheamicin. 

83. The method of Claim 81, wherein the toxin is a maytansinoid. 

84. The method of Claim 74, wherein said antibody is produced in bacteria. 

85. The method of Claim 74, wherein said antibody is produced in CHO cells. 

86. The method of Claim 74, wherein said tumor is further exposed to radiation treatment or a 
chemotherapeutic agent. 

87. The method of Claim 74, wherein said tumor is a breast tumor, a colorectal tumor, a lung 
tumor, an ovarian tumor, a central nervous system tumor, a liver tumor, a bladder tumor, a pancreatic tumor, 
or a cervical tumor. 
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88 . A method of determining the presence of a polypeptide in a sample suspected of containing said 
polypeptide, wherein said polypeptide has at least 80% amino acid sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); or 

(b) an amino acid sequence encoded by a nucleotide sequence comprising the nucleotide sequence shown 
5 in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4), 

said method comprising exposing said sample to an antibody, oligopeptide or organic molecule that binds to said 
polypeptide and determining binding of said antibody, oligopeptide or organic molecule to said polypeptide in 
said sample. 

89. The method of Claim 88, wherein said sample comprises a cell suspected of expressing said 
10 polypeptide. 

90. The method of Claim 89, wherein said cell is a cancer cell. 

91 . The method of Claim 88, wherein said antibody, oligopeptide or organic molecule is detectably 

labeled . 

Z 92. A method of diagnosing the presence of a tumor in a mammal, said method comprising 

15 detecting the level of expression of a gene encoding a polypeptide having at least 80% amino acid sequence 
identity to: 

- (a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 

(SEQ ID NO:7), or Figure 8 (SEQ ID NO:8); or 
* (b) an amino acid sequence encoded by a nucleotide sequence comprising the nucleotide sequence shown 

20 in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4), 
in a test sample of tissue cells obtained from said mammal and in a control sample of known normal cells of the 
a same tissue origin, wherein a higher level of expression of said polypeptide in the test sample, as compared to 
„ the control sample, is indicative of the presence of tumor in the mammal from which the test sample was 
obtained. 

25 93 . The method of Claim 92, wherein the step detecting the level of expression of a gene encoding 

said polypeptide comprises employing an oligonucleotide in an in situ hybridization or RT-PCR analysis. 

94 . The method of Claim 92 , wherein the step detecting the level of expression of a gene encoding 
said polypeptide comprises employing an antibody in an immunohistochemistry analysis. 

95. A method of diagnosing the presence of a tumor in a mammal, said method comprising 
30 contacting a test sample of tissue cells obtained from said mammal with an antibody, oligopeptide or organic 

molecule that binds to a polypeptide having at least 80% amino acid sequence identity to: 

(a) the amino acid sequence shown in Figure 5 (SEQ ID NO:5), Figure 6 (SEQ ID NO:6), Figure 7 
(SEQ ID NO:7), or Figure 8 (SEQ ID NO: 8); or 

(b) an amino acid sequence encoded by a nucleotide sequence comprising the nucleotide sequence shown 
35 in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:2), Figure 3 (SEQ ID NO:3), or Figure 4 (SEQ ID NO:4), 

and detecting the formation of a complex between said antibody, oligopeptide or organic molecule and said 
polypeptide in the test sample, wherein the formation of a complex is indicative of the presence of a tumor in 



said mammal. 

96. The method of Claim 95 , wherein said antibody, oligopeptide or organic molecule is detectably 

labeled. 

97. The method of Claim 95 , wherein said test sample of tissue cells is obtained from an individual 
suspected of having a cancerous tumor. 

5 Further embodiments of the present invention will be evident to the skilled artisan upon a reading of 

the present specification. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a nucleotide sequence (SEQ ID NO:l) of a TAT154 cDNA, wherein SEQ ID NO:l is 
10 a clone designated herein as "DNA49143-1429" . 

Figure 2 shows a nucleotide sequence (SEQ ID NO:2) of a TAT155 cDNA, wherein SEQ ID NO:2 is 
a clone designated herein as "DNA73730-1679". 

Figure 3 shows a nucleotide sequence (SEQ ID NO:3) of a TAT156 cDNA, wherein SEQ ID NO:3 is 
2 a clone designated herein as "DNA77631-2537". 

15 Figure 4 shows a nucleotide sequence (SEQ ID NO:4) of a TAT136 cDNA, wherein SEQ ID NO:4 is 

a clone designated herein as "DNA59610-1556". 

Figure 5 shows the amino acid sequence (SEQ ID NO: 5) derived from the coding sequence of SEQ ID 
NO:l shown in Figure 1. 

Figure 6 shows the amino acid sequence (SEQ ID NO: 6) derived from the coding sequence of SEQ ID 
20 NO: 2 shown in Figure 2. 

Figure 7 shows the amino acid sequence (SEQ ID NO : 7) derived from the coding sequence of SEQ ID 
NO:3 shown in Figure 3. 

* Figure 8 shows the amino acid sequence (SEQ ID NO:8) derived from the coding sequence of SEQ ID 

NO: 4 shown in Figure 4. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

The terms "TAT polypeptide" and "TAT" as used herein and when immediately followed by a 
numerical designation, refer to various polypeptides, wherein the complete designation (i.e. ,TAT/number) refers 
to specific polypeptide sequences as described herein. The terms "TAT/number polypeptide" and 
5 "TAT/number" wherein the term "number" is provided as an actual numerical designation as used herein 
encompass native sequence polypeptides, polypeptide variants and fragments of native sequence polypeptides 
and polypeptide variants (which are further defined herein). The TAT polypeptides described herein may be 
isolated from a variety of sources, such as from human tissue types or from another source, or prepared by 
recombinant or synthetic methods. The term "TAT polypeptide" refers to each individual TAT/number 
10 polypeptide disclosed herein. All disclosures in this specification which refer to the "TAT polypeptide" refer to 
each of the polypeptides individually as well as jointly. For example, descriptions of the preparation of, 
purification of, derivation of, formation of antibodies to or against, formation of TAT binding oligopeptides to 
or against, formation of TAT binding organic molecules to or against, administration of, compositions 
; 0 containing, treatment of a disease with, etc. , pertain to each polypeptide of the invention individually. The term 
jg "TAT polypeptide" also includes variants of the TAT/number polypeptides disclosed herein, 
iff A "native sequence TAT polypeptide" comprises a polypeptide having the same amino acid sequence 

L ^ as the corresponding TAT polypeptide derived from nature. Such native sequence TAT polypeptides can be 
lh isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence TAT 
polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific TAT 
20 polypeptide (e.g., an extracellular domain sequence), naturally -occurring variant forms (e.g., alternatively 
spliced forms) and naturally-occurring allelic variants of the polypeptide. In certain embodiments of the 
f invention, the native sequence TAT polypeptides disclosed herein are mature or full-length native sequence 
polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and 
stop codons (if indicated) are shown in bold font and underlined in the figures. Nucleic acid residues indicated 
25 as "N" in the accompanying figures are any nucleic acid residue. However, while the TAT polypeptides 
disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
acid position 1 in the figures, it is conceivable and possible that other memionine residues located either upstream 
or downstream from the amino acid position 1 in the figures may be employed as the starting amino acid residue 
for the TAT polypeptides. 

30 The TAT polypeptide "extracellular domain" or "ECD" refers to a form of the TAT polypeptide which 

is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a TAT polypeptide ECD will have 
less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of 
such domains. It will be understood that any transmembrane domains identified for the TAT polypeptides of 
the present invention are identified pursuant to criteria routinely employed in the art for identifying that type of 

35 hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, an 
extracellular domain of a TAT polypeptide may contain from about 5 or fewer amino acids on either side of the 
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transmembrane domain/extracellular domain boundary as identified in the Examples or specification and such 
polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are contemplated 
by the present invention. 

The approximate location of the "signal peptides" of the various TAT polypeptides disclosed herein may 
be shown in the present specification and/or the accompanying figures. It is noted, however, that the C-terminal 
boundary of a signal peptide may vary, hut most likely by no more than about 5 amino acids on either side of 
the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the 
signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type of 
amino acid sequence element (e.g. , Nielsen et al. , Prot. Eng. 10: 1-6 (1997) and von Heinje et al. , Nucl. Acids. 
Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence 
from a secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These mature 
polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side of the 
C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, are 
contemplated by the present invention. 

"TAT polypeptide variant" means a TAT polypeptide, preferably an active TAT polypeptide, as defined 
herein having at least about 80% amino acid sequence identity with a full-length native sequence TAT 
polypeptide sequence as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein 
or any other fragment of a full-length TAT polypeptide sequence as disclosed herein (such as those encoded by 
a nucleic acid that represents only a portion of the complete coding sequence for a full-length TAT polypeptide) . 
Such TAT polypeptide variants include, for instance, TAT polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-terminus of the full-length native amino acid sequence. 
Ordinarily, a TAT polypeptide variant will have at least about 80% amino acid sequence identity, alternatively 
at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, or 99% amino acid sequence identity, to a full-length native sequence TAT polypeptide sequence 
as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein or any other specifically 
defined fragment of a full-length TAT polypeptide sequence as disclosed herein. Ordinarily, TAT variant 
polypeptides are at least about 10 amino acids in length, alternatively at least about 20, 30, 40, 50, 60, 70, 80, 
90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 
310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 
520, 530, 540, 550, 560, 570, 580, 590, 600 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" with respect to the TAT polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific TAT polypeptide sequence, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly 
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available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed 
to achieve maximal alignment over the full length of the sequences being compared. For purposes herein, 
however, % amino acid sequence identity values are generated using the sequence comparison computer program 
ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Table 1 below. The 
5 ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code shown 
in Table 1 below has been filed with user documentation in the U. S. Copyright Office, Washington D.C. , 20559, 
where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is publicly 
available through Genentech, Inc., South San Francisco, California or may be compiled from the source code 
provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating system, 
10 preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and 
do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
1 can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 
15 sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
20 ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in 
B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 

■T sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of 

~ B to A. As examples of % amino acid sequence identity calculations using this method, Tables 2 and 3 
demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 

25 "Comparison Protein" to the amino acid sequence designated "TAT" , wherein "TAT" represents the amino acid 
sequence of a hypothetical TAT polypeptide of interest, "Comparison Protein" represents the amino acid 
sequence of a polypeptide against which the "TAT" polypeptide of interest is being compared, and "X, " Y" and 
"Z" each represent different hypothetical amino acid residues. Unless specifically stated otherwise, all % amino 
acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using 

30 the ALIGN-2 computer program. 

"TAT variant polynucleotide" or "TAT variant nucleic acid sequence" means a nucleic acid molecule 
which encodes a TAT polypeptide, preferably an active TAT polypeptide, as defined herein and which has at 
least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native 
sequence TAT polypeptide sequence as disclosed herein, a full-length native sequence TAT polypeptide sequence 

35 lacking the signal peptide as disclosed herein, an extracellular domain of a TAT polypeptide, with or without 
the signal peptide, as disclosed herein or any other fragment of a full-length TAT polypeptide sequence as 
disclosed herein (such as those encoded by a nucleic acid that represents only a portion of the complete coding 
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sequence for a full-length TAT polypeptide). Ordinarily, a TAT variant polynucleotide will have at least about 
80 % nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 
89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity with a 
nucleic acid sequence encoding a full-length native sequence TAT polypeptide sequence as disclosed herein, a 
full-length native sequence TAT polypeptide sequence lacking the signal peptide as disclosed herein, an 
5 extracellular domain of a TAT polypeptide, with or without the signal sequence, as disclosed herein or any other 
fragment of a full-length TAT polypeptide sequence as disclosed herein. Variants do not encompass the native 
nucleotide sequence. 

Ordinarily, TAT variant polynucleotides are at least about 5 nucleotides in length, alternatively at least 
about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 
10 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 
170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 
350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 
560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 
~ 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 

_15 980, 990, or 1000 nucleotides in length, wherein in this context the term "about" means the referenced 
nucleotide sequence length plus or minus 10% of that referenced length. 

"Percent (%) nucleic acid sequence identity" with respect to TAT-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the TAT nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
20 necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 
Z using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 

software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 
sequence comparison computer program ALIGN -2, wherein the complete source code for the ALIGN-2 program 
25 is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored by 
Genentech, Inc. and the source code shown in Table 1 below has been filed with user documentation in the U.S. 
Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. 
TXU510087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, 
California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program should 
30 be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison 
parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
35 sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 



100 times the fraction W/Z 
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where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 
in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to 
C. As examples of % nucleic acid sequence identity calculations, Tables 4 and 5, demonstrate how to calculate 
the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic 
acid sequence designated "TAT-DNA", wherein "TAT-DNA" represents a hypothetical TAT-encoding nucleic 
acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 
against which the "TAT-DNA" nucleic acid molecule of interest is being compared, and "N", "L" and "V" each 
represent different hypothetical nucleotides. Unless specifically stated otherwise, all % nucleic acid sequence 
identity values used herein are obtained as described in the immediately preceding paragraph using the ALIGN-2 
computer program. 

In other embodiments, TAT variant polynucleotides are nucleic acid molecules that encode a TAT 
polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, 
to nucleotide sequences encoding a full-length TAT polypeptide as disclosed herein. TAT variant polypeptides 
may be those that are encoded by a TAT variant polynucleotide. 

"Isolated," when used to describe the various TAT polypeptides disclosed herein, means polypeptide 
that has been identified and separated and/or recovered from a component of its natural environment. 
Contaminant components of its natural environment are materials that would typically interfere with diagnostic 
or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non- 
proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to 
obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, 
or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, 
preferably, silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least 
one component of the TAT polypeptide natural environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" TAT polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is a 
nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 
encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express the 
polypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that of 
natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells 
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are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions 
more stringent, while lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology . Wiley Interscience 
Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum 
albumin/0. 1 % Ficoll/0. 1 % polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M 
sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 fxgluA), 0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42°C 
in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 °C, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55 °C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular 
Cloning; A Laborator y Manual . New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37 °C in a solution 
comprising: 20 % formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50 °C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 
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The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a TAT 
polypeptide or anti-TAT antibody fused to a "tag polypeptide". The tag polypeptide has enough residues to 
provide an epitope against which an antibody can be made, yet is short enough such that it does not interfere with 
activity of the polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the 
antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least 
5 six amino acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 
and 20 amino acid residues). 

"Active" or "activity" for the purposes herein refers to form(s) of a TAT polypeptide which retain a 
biological and/or an immunological activity of native or naturally -occurring TAT, wherein "biological" activity 
refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally-occurring TAT 
10 other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native 
or naturally-occurring TAT and an "immunological" activity refers to the ability to induce the production of an 
antibody against an antigenic epitope possessed by a native or naturally-occurring TAT. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native TAT polypeptide disclosed herein. In a similar 

4.5 manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native TAT polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native TAT polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for 
identifying agonists or antagonists of a TAT polypeptide may comprise contacting a TAT polypeptide with a 

-20 candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities 
normally associated with the TAT polypeptide. 

"Treating" or "treatment" or "alleviation" refers to both therapeutic treatment and prophylactic or 
preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition 
or disorder. Those in need of treatment include those already with the disorder as well as those prone to have 

25 the disorder or those in whom the disorder is to be prevented. A subject or mammal is successfully "treated" 
for a TAT polypep tide-expressing cancer if, after receiving a therapeutic amount of an anti-TAT antibody, TAT 
binding oligopeptide or TAT binding organic molecule according to the methods of the present invention, the 
patient shows observable and/or measurable reduction in or absence of one or more of the following: reduction 
in the number of cancer cells or absence of the cancer cells; reduction in the tumor size; inhibition (i.e., slow 

30 to some extent and preferably stop) of cancer cell infiltration into peripheral organs including the spread of 
cancer into soft tissue and bone; inhibition (i.e., slow to some extent and preferably stop) of tumor metastasis; 
inhibition, to some extent, of tumor growth; and/or relief to some extent, one or more of the symptoms 
associated with the specific cancer; reduced morbidity and mortality, and improvement in quality of life issues. 
To the extent the anti-TAT antibody or TAT binding oligopeptide may prevent growth and/or kill existing cancer 

35 cells, it may be cytostatic and/or cytotoxic. Reduction of these signs or symptoms may also be felt by the 
patient. 



24 



The above parameters for assessing successful treatment and improvement in the disease are readily 
measurable by routine procedures familiar to a physician. For cancer therapy, efficacy can be measured, for 
example, by assessing the time to disease progression (TTP) and/or determining the response rate (RR). 
Metastasis can be determined by staging tests and by bone scan and tests for calcium level and other enzymes 
to determine spread to the bone. CT scans can also be done to look for spread to the pelvis and lymph nodes 
5 in the area. Chest X-rays and measurement of liver enzyme levels by known methods are used to look for 
metastasis to the lungs and liver, respectively. Other routine methods for monitoring the disease include 
transrectal ultrasonography (TRUS) and transrectal needle biopsy (TRNB). 

For bladder cancer, which is a more localized cancer, methods to determine progress of disease include 
urinary cytologic evaluation by cystoscopy, monitoring for presence of blood in the urine, visualization of the 
10 urothelial tract by sonography or an intravenous pyelogram, computed tomography (CT) and magnetic resonance 
imaging (MRI). The presence of distant metastases can be assessed by CT of the abdomen, chest x-rays, or 
radionuclide imaging of the skeleton. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 
^ an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 

15 "Intermittent" administration is treatment that is not consecutively done without interruption, but rather is cyclic 
= in nature. 

"Mammal" for purposes of the treatment of, alleviating the symptoms of or diagnosis of a cancer refers 
to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, or pet 
animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the mammal is human. 

20 Administration "in combination with" one or more further therapeutic agents includes simultaneous 

(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 

_ are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 

the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 

25 acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 

30 forming counterions such as sodium; and/or nonionic surfactants such as TWEEN®, polyethylene glycol (PEG), 
and PLURONICS®. 

By "solid phase" or "solid support" is meant a non-aqueous matrix to which an antibody, TAT binding 
oligopeptide or TAT binding organic molecule of the present invention can adhere or attach. Examples of solid 
phases encompassed herein include those formed partially or entirely of glass (e.g., controlled pore glass), 
35 polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain 
embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is 
a purification column (e.g. , an affinity chromatography column). This term also includes a discontinuous solid 



phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is useful for delivery of a drug (such as a TAT polypeptide, an antibody thereto or a TAT binding 
oligopeptide) to a mammal. The components of the liposome are commonly arranged in a bilayer formation, 
similar to the lipid arrangement of biological membranes. 
5 A "small" molecule or "small" organic molecule is defined herein to have a molecular weight below 

about 500 Daltons. 

An "effective amount" of a polypeptide, antibody, TAT binding oligopeptide, TAT binding organic 
molecule or an agonist or antagonist thereof as disclosed herein is an amount sufficient to carry out a specifically 
stated purpose. An "effective amount" may be determined empirically and in a routine manner, in relation to 
10 the stated purpose. 

The term "therapeutically effective amount" refers to an amount of an antibody, polypeptide, TAT 
binding oligopeptide, TAT binding organic molecule or other drug effective to "treat" a disease or disorder in 
a subject or mammal. In the case of cancer, the therapeutically effective amount of the drug may reduce the 
;p number of cancer cells; reduce the tumor size; inhibit (i.e., slow to some extent and preferably stop) cancer cell 
;.;f 5 infiltration into peripheral organs; inhibit (i.e., slow to some extent and preferably stop) tumor metastasis; 
;Q inhibit, to some extent, tumor growth; and/or relieve to some extent one or more of the symptoms associated 
^ with the cancer. See the definition herein of "treating" . To the extent the drug may prevent growth and/or kill 
jp existing cancer cells, it may be cytostatic and/or cytotoxic. 

; A "growth inhibitory amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide 

1^0 or TAT binding organic molecule is an amount capable of inhibiting the growth of a cell, especially tumor, e.g., 
I cancer cell, either in vitro or in vivo. A "growth inhibitory amount" of an anti-TAT antibody, TAT polypeptide, 

;== TAT binding oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic cell growth 

i = i may be determined empirically and in a routine manner. 

A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide or TAT 
25 binding organic molecule is an amount capable of causing the destruction of a cell, especially tumor, e.g., cancer 
cell, either in vitro or in vivo. A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT binding 
oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic cell growth may be 
determined empirically and in a routine manner. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-TAT 
30 monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-TAT antibody 
compositions with polyepitopic specificity, polyclonal antibodies, single chain anti-TAT antibodies, and 
fragments of anti-TAT antibodies (see below) as long as they exhibit the desired biological or immunological 
activity. The term "immunoglobulin" (Ig) is used interchangeable with antibody herein. 

An "isolated antibody" is one which has been identified and separated and/or recovered from a 
35 component of its natural environment. Contaminant components of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 



to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 
sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 
nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody 
in situ within recombinant cells since at least one component of the antibody's natural environment will not be 
5 present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

The basic 4-chain antibody unit is a heterotetrameric glycoprotein composed of two identical light (L) 
chains and two identical heavy (H) chains (an IgM antibody consists of 5 of the basic heterotetramer unit along 
with an additional polypeptide called J chain, and therefore contain 10 antigen binding sites, while secreted IgA 
antibodies can polymerize to form polyvalent assemblages comprising 2-5 of the basic 4-chain units along with 

10 J chain). In the case of IgGs, the 4-chain unit is generally about 150,000 daltons. Each L chain is linked to a 
H chain by one covalent disulfide bond, while the two H chains are linked to each other by one or more disulfide 
bonds depending on the H chain isotype. Each H and L chain also has regularly spaced intrachain disulfide 
bridges. Each H chain has at the N-terminus, a variable domain (V H ) followed by three constant domains (C H ) 
for each of the a and y chains and four C H domains for n and e isotypes. Each L chain has at the N-terminus, 

15 a variable domain (V L ) followed by a constant domain (C L ) at its other end. The V L is aligned with the V H and 
the C L is aligned with the first constant domain of the heavy chain (C H 1). Particular amino acid residues are 

I believed to form an interface between the light chain and heavy chain variable domains. The pairing of a V H 

~ and V L together forms a single antigen-binding site. For the structure and properties of the different classes of 

antibodies, see, e.g., Basic and Clinical Immunology . 8th edition, Daniel P. Stites, Abba I. Terr and Tristram 

HO G. Parslow (eds.), Appleton & Lange, Norwalk, CT, 1994, page 71 and Chapter 6. 

The L chain from any vertebrate species can be assigned to one of two clearly distinct types, called 

~ kappa and lambda, based on the amino acid sequences of their constant domains. Depending on the amino acid 

sequence of the constant domain of their heavy chains (C H ), immunoglobulins can be assigned to different classes 
or isotypes. There are five classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, having heavy chains 

25 designated ot, 8, e, y, and p., respectively. The y and a classes are further divided into subclasses on the basis 
of relatively minor differences in C H sequence and function, e.g., humans express the following subclasses: 
IgGl, IgG2, IgG3, IgG4, IgAl, and IgA2. 

The term "variable" refers to the fact that certain segments of the variable domains differ extensively 
in sequence among antibodies. The V domain mediates antigen binding and define specificity of a particular 

30 antibody for its particular antigen. However, the variability is not evenly distributed across the 1 10-amino acid 
span of the variable domains. Instead, the V regions consist of relatively invariant stretches called framework 
regions (FRs) of 15-30 amino acids separated by shorter regions of extreme variability called "hypervariable 
regions" that are each 9-12 amino acids long. The variable domains of native heavy and light chains each 
comprise four FRs, largely adopting a P-sheet configuration, connected by three hypervariable regions, which 

35 form loops connecting, and in some cases forming part of, the (3-sheet structure. The hypervariable regions in 
each chain are held together in close proximity by the FRs and, with the hypervariable regions from the other 
chain, contribute to the formation of the antigen-binding site of antibodies (see Kabat et al., Sequences of 



Proteins of Immunological Interest , 5th Ed. Public Health Service, National Institutes of Health, Bethesda, MD. 
(1991)). The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various 
effector functions, such as participation of the antibody in antibody dependent cellular cytotoxicity (ADCC). 

The term "hypervariable region" when used herein refers to the amino acid residues of an antibody 
which are responsible for antigen-binding. The hypervariable region generally comprises amino acid residues 
5 from a "complementarity determining region" or "CDR" (e.g. around about residues 24-34 (LI), 50-56 (L2) 
and 89-97 (L3) in the V L , and around about 1-35 (HI), 50-65 (H2) and 95-102 (H3) in the V H ; Kabat et al., 
Sequences of Proteins of Immunological Interest , 5th Ed. Public Health Service, National Institutes of Health, 
Bethesda, MD. (1991)) and/or those residues from a "hypervariable loop" (e.g. residues 26-32 (LI), 50-52 (L2) 
and 91-96 (L3) in the V L , and 26-32 (HI), 53-55 (H2) and 96-101 (H3) in the V H ; Chothia and Lesk J. Mol. 
10 BioL 196:901-917 (1987)). 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies 

? ~= are highly specific, being directed against a single antigenic site. Furthermore, in contrast to polyclonal antibody 

preparations which include different antibodies directed against different determinants (epitopes), each 

1 U monoclonal antibody is directed against a single determinant on the antigen. In addition to their specificity, the 

monoclonal antibodies are advantageous in that they may be synthesized uncontaminated by other antibodies. 

yl The modifier "monoclonal" is not to be construed as requiring production of the antibody by any particular 
method. For example, the monoclonal antibodies useful in the present invention may be prepared by the 

|;30 hybridoma methodology first described by Kohler et al., Nature . 256:495 (1975), or may be made using 

■*f recombinant DNA methods in bacterial, eukaryotic animal or plant cells (see, e.g., U.S. Patent No. 4,816,567). 
f; The "monoclonal antibodies" may also be isolated from phage antibody libraries using the techniques described 

13 in Clackson et al., Nature . 352:624-628 (1991) and Marks et al., J. Mol. BioL , 222:581-597 (1991), for 
example. 

25 The monoclonal antibodies herein include "chimeric" antibodies in which a portion of the heavy and/or 

light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular 
species or belonging to a particular antibody class or subclass, while the remainder of the chain(s) is identical 
with or homologous to corresponding sequences in antibodies derived from another species or belonging to 
another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired 

30 biological activity (see U.S. Patent No. 4,816,567; and Morrison et al., Proc. Natl. Acad. Sci. USA . 81:6851- 
6855 (1984)). Chimeric antibodies of interest herein include "primatized" antibodies comprising variable domain 
antigen-binding sequences derived from a non-human primate (e.g. Old World Monkey, Ape etc), and human 
constant region sequences. 

An "intact" antibody is one which comprises an antigen-binding site as well as a C L and at least heavy 

35 chain constant domains, C H 1, C H 2 and C H 3. The constant domains may be native sequence constant domains 
(e.g. human native sequence constant domains) or amino acid sequence variant thereof. Preferably, the intact 
antibody has one or more effector functions. 



"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab') 2 , and Fv 
fragments; diabodies; linear antibodies (see U.S. Patent No. 5,641,870, Example 2; Zapata et al., Protein Eng. 
8(10): 1057-1062 [1995]); single-chain antibody molecules; and multispecific antibodies formed from antibody 
fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, and a residual "Fc" fragment, a designation reflecting the ability to crystallize readily. The Fab 
fragment consists of an entire L chain along with the variable region domain of the H chain (V H ), and the first 
constant domain of one heavy chain (C H 1). Each Fab fragment is monovalent with respect to antigen binding, 
i.e., it has a single antigen-binding site. Pepsin treatment of an antibody yields a single large F(ab') 2 fragment 
which roughly corresponds to two disulfide linked Fab fragments having divalent antigen-binding activity and 
is still capable of cross-linking antigen. Fab' fragments differ from Fab fragments by having additional few 
residues at the carboxy terminus of the C H 1 domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab 1 in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab') 2 antibody fragments originally were produced as pairs of Fab' fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The Fc fragment comprises the carboxy-terminal portions of both H chains held together by disulfides. 
The effector functions of antibodies are determined by sequences in the Fc region, which region is also the part 
recognized by Fc receptors (FcR) found on certain types of cells. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 
site. This fragment consists of a dimer of one heavy- and one light-chain variable region domain in tight, non- 
covalent association. From the folding of these two domains emanate six hypervariable loops (3 loops each from 
the H and L chain) that contribute the amino acid residues for antigen binding and confer antigen binding 
specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the 
entire binding site. 

"Single-chain Fv" also abbreviated as "sFv" or "scFv" are antibody fragments that comprise the V H 
and V L antibody domains connected into a single polypeptide chain. Preferably, the sFv polypeptide further 
comprises a polypeptide linker between the V H and V L domains which enables the sFv to form the desired 
structure for antigen binding. For a review of sFv, see Pluckthun in The Pharmacology of Monoclonal 
Antibodies , vol. 113, Rosenburg and Moore eds., Springer- Verlag, New York, pp. 269-315 (1994); Borrebaeck 
1995, infra. 

The term "diabodies" refers to small antibody fragments prepared by constructing sFv fragments (see 
preceding paragraph) with short linkers (about 5-10 residues) between the V H and V L domains such that inter- 
chain but not intra-chain pairing of the V domains is achieved, resulting in a bivalent fragment, i.e., fragment 
having two antigen-binding sites. Bispecific diabodies are heterodimers of two "crossover" sFv fragments in 
which the V H and V L domains of the two antibodies are present on different polypeptide chains. Diabodies are 
described more fully in, for example, EP 404,097; WO 93/11161; and Hollinger et al., Proc. Natl. Acad. Sci. 



USA, 90:6444-6448 (1993). 

"Humanized" forms of non-human (e.g., rodent) antibodies are chimeric antibodies that contain minimal 
sequence derived from the non-human antibody. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a hypervariable region of the recipient are replaced 
by residues from a hypervariable region of a non-human species (donor antibody) such as mouse, rat, rabbit or 
non-human primate having the desired antibody specificity, affinity, and capability. In some instances, 
framework region (FR) residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Furthermore, humanized antibodies may comprise residues that are not found in the recipient antibody 
or in the donor antibody. These modifications are made to further refine antibody performance. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which 
all or substantially all of the hypervariable loops correspond to those of a non-human immunoglobulin and all 
or substantially all of the FRs are those of a human immunoglobulin sequence. The humanized antibody 
optionally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin. For further details, see Jones et al., Nature 321:522-525 (1986); Riechmann et al., 
Nature 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. 2:593-596 (1992). 

A "species-dependent antibody," e.g., a mammalian anti-human IgE antibody, is an antibody which 
has a stronger binding affinity for an antigen from a first mammalian species than it has for a homologue of that 
antigen from a second mammalian species. Normally, the species-dependent antibody "bind specifically'' to a 
human antigen (i.e., has a binding affinity (Kd) value of no more than about 1 x 10" 7 M, preferably no more than 
about 1 x 10" s and most preferably no more than about 1 x 10" 9 M) but has a binding affinity for a homologue 
of the antigen from a second non-human mammalian species which is at least about 50 fold, or at least about 500 
fold, or at least about 1000 fold, weaker than its binding affinity for the human antigen. The species-dependent 
antibody can be of any of the various types of antibodies as defined above, but preferably is a humanized or 
human antibody. 

A "TAT binding oligopeptide" is an oligopeptide that binds, preferably specifically, to a TAT 
polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using known 
oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. TAT 
binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 
38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59 , 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable of 
binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may be 
identified without undue experimentation using well known techniques. In this regard, it is noted that techniques 
for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a polypeptide 
target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 4,833,092, 
5,223,409, 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and WO84/03564; Geysen 
et al., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Geysen et al., Proc. Natl. Acad. Sci. U.S.A., 
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82:178-182(1985); Geysenetal., in Synthetic Peptides as Antigens, 130-149(1986); GeysenetaL, J. Immunol. 
Meth., 102:259-274 (1987); Schoofs et al., J. Immunol., 140:611-616 (1988), Cwirla, S. E. etal. (1990) Proc. 
Natl. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, T. et al. 
(1991) Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) Proc. 
Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). 

A "TAT binding organic molecule" is an organic molecule other than an oligopeptide or antibody as 
defined herein that binds, preferably specifically, to a TAT polypeptide as described herein. TAT binding 
organic molecules may be identified and chemically synthesized using known methodology (see, e.g., PCT 
Publication Nos. WO00/00823 and WO00/39585). TAT binding organic molecules are usually less than about 
2000 daltons in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such 
organic molecules that are capable of binding, preferably specifically, to a TAT polypeptide as described herein 
may be identified without undue experimentation using well known techniques. In this regard, it is noted that 
techniques for screening organic molecule libraries for molecules that are capable of binding to a polypeptide 
target are well known in the art (see, e.g., PCT Publication Nos. WO00/00823 and WOOO/39585). 

An antibody, oligopeptide or other organic molecule "which binds" an antigen of interest, e.g. atumor- 
associated polypeptide antigen target, is one that binds the antigen with sufficient affinity such that the antibody, 
oligopeptide or other organic molecule is useful as a diagnostic and/or therapeutic agent in targeting a cell 
expressing the antigen, and does not significantly cross-react with other proteins. In such embodiments, the 
extent of binding of the antibody, oligopeptide or other organic molecule to a "non-target" protein will be less 
than about 10% of the binding of the antibody, oligopeptide or other organic molecule to its particular target 
protein as determined by fluorescence activated cell sorting (FACS) analysis or radioimmunoprecipitation (RIA). 
An antibody, oligopeptide or other organic molecule that "specifically binds to" or is "specific for" a particular 
polypeptide or an epitope on a particular polypeptide is one that binds to that particular polypeptide or epitope 
on a particular polypeptide without substantially binding to any other polypeptide or polypeptide epitope. 

An antibody, oligopeptide or other organic molecule that "inhibits the growth of tumor cells expressing 
a TAT polypeptide" or a "growth inhibitory" antibody, oligopeptide or other organic molecule is one which 
binds to and results in measurable growth inhibition of cancer cells expressing or overexpressing the appropriate 
TAT polypeptide. Preferred growth inhibitory anti-TAT antibodies, oligopeptides or organic molecules inhibit 
growth of TAT-expressing tumor cells by greater than 20 % , preferably from about 20 % to about 50 % , and even 
more preferably, by greater than 50% (e.g., from about 50% to about 100%) as compared to the appropriate 
control, the control typically being tumor cells not treated with the antibody, oligopeptide or other organic 
molecule being tested. In one embodiment, growth inhibition can be measured at an antibody concentration of 
about 0. 1 to 30 /xg/ml or about 0.5 nM to 200 nM in cell culture, where the growth inhibition is determined 1-10 
days after exposure of the tumor cells to the antibody. Growth inhibition of tumor cells in vivo can be 
determined in various ways such as is described in the Experimental Examples section below. The antibody is 
growth inhibitory in vivo if administration of the anti-TAT antibody at about 1 /*g/kg to about 100 mg/kg body 
weight results in reduction in tumor size or tumor cell proliferation within about 5 days to 3 months from the 
first administration of the antibody, preferably within about 5 to 30 days. 
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An antibody, oligopeptide or other organic molecule which "induces apoptosis" is one which induces 
programmed cell death as determined by binding of annexin V, fragmentation of DNA, cell shrinkage, dilation 
of endoplasmic reticulum, cell fragmentation, and/or formation of membrane vesicles (called apoptotic bodies). 
The cell is usually one which overexpresses a TAT polypeptide. Preferably the cell is a tumor cell, e.g., a 
prostate, breast, ovarian, stomach, endometrial, lung, kidney, colon, bladder cell. Various methods are available 
for evaluating the cellular events associated with apoptosis. For example, phosphatidyl serine (PS) translocation 
can be measured by annexin binding; DNA fragmentation can be evaluated through DNA laddering; and 
nuclear/ chromatin condensation along with DNA fragmentation can be evaluated by any increase in hypodiploid 
cells. Preferably, the antibody, oligopeptide or other organic molecule which induces apoptosis is one which 
results in about 2 to 50 fold, preferably about 5 to 50 fold, and most preferably about 10 to 50 fold, induction 
of annexin binding relative to untreated cell in an annexin binding assay. 

Antibody "effector functions" refer to those biological activities attributable to the Fc region (a native 
sequence Fc region or amino acid sequence variant Fc region) of an antibody, and vary with the antibody 
isotype. Examples of antibody effector functions include: Clq binding and complement dependent cytotoxicity; 
Fc receptor binding; antibody-dependent cell-mediated cytotoxicity (ADCC); phagocytosis; down regulation of 
cell surface receptors (e.g., B cell receptor); and B cell activation. 

"Antibody-dependent cell-mediated cytotoxicity" or "ADCC" refers to a form of cytotoxicity in which 
secreted Ig bound onto Fc receptors (FcRs) present on certain cytotoxic cells (e.g., Natural Killer (NK) cells, 
neutrophils, and macrophages) enable these cytotoxic effector cells to bind specifically to an antigen-bearing 
target cell and subsequently kill the target cell with cytotoxins. The antibodies "arm" the cytotoxic cells and are 
absolutely required for such killing. The primary cells for mediating ADCC, NK cells, express FcyRIII only, 
whereas monocytes express FcyRI, FcyRII and FcyRIII. FcR expression on hematopoietic cells is summarized 
in Table 3 on page 464 of Ravetchand Kinet, Annu. Rev. Immunol. 9:457-92 (1991). To assess ADCC activity 
of a molecule of interest, an in vitro ADCC assay, such as that described in US Patent No. 5,500,362 or 
5,821 ,337 may be performed. Useful effector cells for such assays include peripheral blood mononuclear cells 
(PBMC) and Natural Killer (NK) cells. Alternatively, or additionally, ADCC activity of the molecule of interest 
may be assessed in vivo, e.g., in a animal model such as that disclosed in Clynes et al. (USA) 95:652-656 
(1998). 

"Fc receptor" or "FcR" describes a receptor that binds to the Fc region of an antibody. The preferred 
FcR is a native sequence human FcR. Moreover, a preferred FcR is one which binds an IgG antibody (a gamma 
receptor) and includes receptors of the FcyRI, FcyRII and FcyRIII subclasses, including allelic variants and 
alternatively spliced forms of these receptors. FcyRII receptors include FcyRIIA (an "activating receptor") and 
Fey RUB (an "inhibiting receptor"), which have similar amino acid sequences that differ primarily in the 
cytoplasmic domains thereof. Activating receptor FcyRIIA contains an immunoreceptor tyrosine-based 
activation motif (ITAM) in its cytoplasmic domain. Inhibiting receptor Fey RUB contains an immunoreceptor 
tyrosine-based inhibition motif (ITIM) in its cytoplasmic domain, (see review M. in Daeron, Annu. Rev. 
Immunol. 15:203-234 (1997)). FcRs are reviewed in Ravetch and Kinet, Annu. Rev. Immunol. 9:457-492 
(1991); Capel et al. . Immunomethods 4:25-34 (1994); and de Haas et al. , J. Lab. Clin. Med. 126:330-41 (1995). 
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Other FcRs, including those to be identified in the future, are encompassed by the term "FcR" herein. The term 
also includes the neonatal receptor, FcRn, which is responsible for the transfer of maternal IgGs to the fetus 
(Guyer et al., J. Immunol. 117:587 (1976) and Kim et al., J. Immunol. 24:249 (1994)). 

"Human effector cells" are leukocytes which express one or more FcRs and perform effector functions. 
Preferably, the cells express at least FcyRIII and perform ADCC effector function. Examples of human 
leukocytes which mediate ADCC include peripheral blood mononuclear cells (PBMC), natural killer (NK) cells, 
monocytes, cytotoxic T cells and neutrophils; with PBMCs and NK cells being preferred. The effector cells may 
be isolated from a native source, e.g., from blood. 

"Complement dependent cytotoxicity" or "CDC" refers to the lysis of a target cell in the presence of 
complement. Activation of the classical complement pathway is initiated by the binding of the first component 
of the complement system (Clq) to antibodies (of the appropriate subclass) which are bound to their cognate 
antigen. To assess complement activation, a CDC assay, e.g., as described in Gazzano-Santoro et al., JL 
Immunol. Methods 202:163 (1996), may be performed. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in mammals that 
is typically characterized by unregulated cell growth. Examples of cancer include, but are not limited to, 
carcinoma, lymphoma, blastoma, sarcoma, and leukemia or lymphoid malignancies. More particular examples 
of such cancers include squamous cell cancer (e.g., epithelial squamous cell cancer), lung cancer including small- 
cell lung cancer, non-small cell lung cancer, adenocarcinoma of the lung and squamous carcinoma of the lung, 
cancer of the peritoneum, hepatocellular cancer, gastric or stomach cancer including gastrointestinal cancer, 
pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinary tract, hepatoma, breast cancer, colon cancer, rectal cancer, colorectal cancer, endometrial or uterine 
carcinoma, salivary gland carcinoma, kidney or renal cancer, prostate cancer, vulval cancer, thyroid cancer, 
hepatic carcinoma, anal carcinoma, penile carcinoma, melanoma, multiple myeloma and B-cell lymphoma, brain, 
as well as head and neck cancer, and associated metastases. 

"Tumor", as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or 
benign, and all pre-cancerous and cancerous cells and tissues. 

An antibody, oligopeptide or other organic molecule which "induces cell death" is one which causes 
a viable cell to become nonviable. The cell is one which expresses a TAT polypeptide, preferably a cell that 
overexpresses a TAT polypeptide as compared to a normal cell of the same tissue type. Preferably, the cell is 
a cancer cell, e.g., a breast, ovarian, stomach, endometrial, salivary gland, lung, kidney, colon, thyroid, 
pancreatic or bladder cell. Cell death in vitro may be determined in the absence of complement and immune 
effector cells to distinguish cell death induced by antibody-dependent cell-mediated cytotoxicity (ADCC) or 
complement dependent cytotoxicity (CDC). Thus, the assay for cell death may be performed using heat 
inactivated serum (i.e. , in the absence of complement) and in the absence of immune effector cells. To determine 
whether the antibody, oligopeptide or other organic molecule is able to induce cell death, loss of membrane 
integrity as evaluated by uptake of propidium iodide (PI), trypan blue (see Moore et al. Cvtotechnology 17:1-11 
(1995)) or 7AAD can be assessed relative to untreated cells. Preferred cell death-inducing antibodies, 
oligopeptides or other organic molecules are those which induce PI uptake in the PI uptake assay in BT474 cells. 
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A "TAT-expressing cell" is a cell which expresses an endogenous or transfected TAT polypeptide on 
the cell surface. A "TAT-expressing cancer" is a cancer comprising cells that have a TAT polypeptide present 
on the cell surface. A "TAT-expressing cancer" produces sufficient levels of TAT polypeptide on the surface 
of cells thereof, such that an anti-TAT antibody, oligopeptide ot other organic molecule can bind thereto and 
have a therapeutic effect with respect to the cancer. A cancer which "overexpresses" a TAT polypeptide is one 
5 which has significantly higher levels of TAT polypeptide at the cell surface thereof, compared to a noncancerous 
cell of the same tissue type. Such overexpression may be caused by gene amplification or by increased 
transcription or translation. TAT polypeptide overexpression may be determined in a diagnostic or prognostic 
assay by evaluating increased levels of the TAT protein present on the surface of a cell (e.g., via an 
immunohistochemistry assay using anti-TAT antibodies prepared against an isolated TAT polypeptide which may 
10 be prepared using recombinant DNA technology from an isolated nucleic acid encoding the TAT polypeptide; 
FACS analysis, etc.). Alternatively, or additionally, one may measure levels of TAT polypeptide-encoding 
nucleic acid or mRNA in the cell, e.g., via fluorescent in situ hybridization using a nucleic acid based probe 
_ corresponding to a TAT-encoding nucleic acid or the complement thereof; (FISH; see W098/45479 published 

ip October, 1998), Southern blotting, Northern blotting, or polymerase chain reaction (PCR) techniques, such as 

; J|5 real time quantitative PCR (RT-PCR). One may also study TAT polypeptide overexpression by measuring shed 
;D antigen in a biological fluid such as serum, e.g, using antibody-based assays (see also, e.g., U.S. Patent No. 

^ 4,933 ,294 issued June 12, 1990; W09 1/05264 published April 18, 1991; U.S. Patent 5,401,638 issued March 

,q 28, 1995; and Sias et al., J. Immunol. Methods 132:73-80 (1990)). Aside from the above assays, various in 

>■ vivo assays are available to the skilled practitioner. For example, one may expose cells within the body of the 

J patient to an antibody which is optionally labeled with a detectable label, e.g. , a radioactive isotope, and binding 

I a* of the antibody to cells in the patient can be evaluated, e.g., by external scanning for radioactivity or by 

; g analyzing a biopsy taken from a patient previously exposed to the antibody. 

i-i As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 

binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
25 constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
30 immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
directly or indirectly to the antibody, oligopeptide or other organic molecule so as to generate a "labeled" 
antibody, oligopeptide or other organic molecule. The label may be detectable by itself (e.g. radioisotope labels 
35 or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a substrate 
compound or composition which is detectable. 
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The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function 
of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes (e.g., At 211 , 1 131 , 
I 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 and radioactive isotopes, of Lu), chemotherapeutic agents e.g. 
methotrexate, adriamicin, vinca alkaloids (vincristine, vinblastine, etoposide), doxorubicin, melphalan, 
mitomycin C, chlorambucil, daunorubicin or other intercalating agents, enzymes and fragments thereof such as 
5 nucleolytic enzymes, antibiotics, and toxins such as small molecule toxins or enzymatically active toxins of 
bacterial, fungal, plant or animal origin, including fragments and/or variants thereof, and the various antitumor 
or anticancer agents disclosed below. Other cytotoxic agents are described below. A tumoricidal agent causes 
destruction of tumor cells. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 

10 growth of a cell, especially a TAT-expressing cancer cell, either in vitro or in vivo. Thus, the growth inhibitory 
agent may be one which significantly reduces the percentage of TAT-expressing cells in S phase. Examples of 
growth inhibitory agents include agents that block cell cycle progression (at a place other than S phase), such 
as agents that induce Gl arrest and M-phase arrest. Classical M-phase blockers include the vincas (vincristine 

= and vinblastine), taxanes, and topoisomerase II inhibitors such as doxorubicin, epirubicin, daunorubicin, 

-15 etoposide, and bleomycin. Those agents that arrest Gl also spill over into S-phase arrest, for example, DNA 
alkylating agents such as tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 5- 
fluorouracil, and ara-C. Further information can be found in The Molecular Basis of Cancer . Mendelsohn and 
Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et 
al. (WB Saunders: Philadelphia, 1995), especially p. 13. The taxanes (paclitaxel and docetaxel) are anticancer 

20 drugs both derived from the yew tree. Docetaxel (TAXOTERE®, Rhone-Poulenc Rorer), derived from the 
European yew, is a semisynthetic analogue of paclitaxel (TAXOL®, Bristol-Myers Squibb). Paclitaxel and 
docetaxel promote the assembly of microtubules from tubulin dimers and stabilize microtubules by preventing 
depolymerization, which results in the inhibition of mitosis in cells. 

"Doxorubicin" is an anthracycline antibiotic. The full chemical name of doxorubicin is (8S-cis)-10-[(3- 

25 amino-2,3,6-trideoxy-a-L-lyxo-hexapyranosyl)oxy]-7,8,9,10-tetrahydro-6,8,ll-trihydroxy-8-(hy(koxyace 
methoxy-5 , 12-naphfhacenedione . 

The term "cytokine" is a generic term for proteins released by one cell population which act on another 
cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and traditional 
polypeptide hormones. Included among the cytokines are growth hormone such as human growth hormone, N- 

30 methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; 
proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid 
stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; fibroblast growth factor; 
prolactin; placental lactogen; tumor necrosis factor-a and -P; mullerian-inhibiting substance; mouse 
gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; fhrombopoietin 

35 (TPO); nerve growth factors such as NGF-p; platelet-growth factor; transforming growth factors (TGFs) such 
as TGF-cc and TGF-P; insulin-like growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; 
interferons such as interferon -a, -p, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M- 
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CSF); granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); interleukins (ILs) such as IL-1 , 
IL- la, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-11, IL-12; a tumor necrosis factor such as TNF-a or 
TNF-J3; and other polypeptide factors including LIF and kit ligand (KL). As used herein, the term cytokine 
includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of 
the native sequence cytokines. 

The term "package insert" is used to refer to instructions customarily included in commercial packages 
of therapeutic products, that contain information about the indications, usage, dosage, administration, 
contraindications and/or warnings concerning the use of such therapeutic products. 
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Table 1 



/* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 

*/ 

#define _M -8 /* value of a match with a stop */ 

int _day[26][26] = { 

/* ABCDEFGHIJKLMNOPQRSTUVWXYZ*/ 

/* A */ { 2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0,_M, 1, 0,-2, 1, 1, 0, 0,-6, 0,-3, 0}, 

/* B */ { 0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2,_M,-1, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 

/* C */ {-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4,_M,-3,-5,-4, 0,-2, 0,-2,-8, 0, 0,-5}, 

/* D */ { 0, 3,-5, 4, 3,-6, 1, 1,-2, 0, 0,-4,-3, 2,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 

/* E */ { 0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 3}, 

/* F */ {-4,-5,-4,-6,-5, 9,-5,-2, 1, 0,-5, 2, 0,-4,_M,-5,-5,-4,-3,-3, 0,-1, 0, 0, 7,-5}, 

/* G */ { 1, 0,-3, 1, 0,-5, 5,-2,-3, 0,-2,-4,-3, 0,_M,-l,-l,-3, 1, 0, 0,-1,-7, 0,-5, 0}, 

/* H */ {-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2,_M, 0, 3, 2,-1,-1, 0,-2,-3, 0, 0, 2}, 

/* I */ {-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2,_M,-2,-2,-2,-l, 0, 0, 4,-5, 0,-1,-2}, 

/* J */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* K */ {-1, 0,-5, 0, 0,-5,-2, 0,-2, 0, 5,-3, 0, 1,_M,-1, 1, 3, 0, 0, 0,-2,-3, 0,-4, 0}, 

/* L */ {-2,-3,-6,-4,-3, 2,-4,-2, 2, 0,-3, 6, 4,-3,_M,-3,-2,-3,-3,-l, 0, 2,-2, 0,-1,-2}, 

/* M */ {-l,-2,-5,-3',-2, 0,-3,-2, 2, 0, 0, 4, 6,-2,_M,-2,-l, 0,-2,-1, 0, 2,-4, 0,-2,-1}, 

/* N */ { 0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1,-3,-2, 2,_M,-1, 1, 0, 1, 0, 0,-2,-4, 0,-2, 1}, 

/* O */ {_M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M, 0,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_M}, 

/* P */ { 1,-1,-3,-1,-1,-5,-1, 0,-2, 0,-l,-3,-2,-l,_M, 6, 0, 0, 1, 0, 0,-1,-6, 0,-5, 0}, 

/* Q */ { 0, 1,-5, 2, 2,-5,-1, 3,-2, 0, 1,-2,-1, 1,_M, 0, 4, 1,-1,-1, 0,-2,-5, 0,-4, 3}, 

/* R */ {-2, 0,-4,-1,-1,-4,-3, 2,-2, 0, 3,-3, 0, 0,_M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 

/* S */ { 1, 0, 0, 0, 0,-3, 1,-1,-1, 0, 0,-3,-2, 1,_M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 

/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}, 

/* U */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* V */ { 0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2,-2,_M,-l,-2,-2,-l, 0, 0, 4,-6, 0,-2,-2}, 

/* W */ {-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3,-2,-4,-4,_M,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 

/* X */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* Y */ {-3,-3, 0,-4,-4, 7,-5, 0,-1, 0,-4,-l,-2,-2,_M,-5,-4,-4,-3,-3, 0,-2, 0, 0,10,-4}, 

/* Z */ { 0, 1,-5, 2, 3,-5, 0, 2,-2, 0, 0,-2,-1, 1,_M, 0, 3, 0, 0, 0, 0,-2,-6, 0,-4, 4} 

>; 
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#include <stdio.h> 






^include <ctype.h> 






#define MAXJMP 


16 


/* 


#define MAXGAP 


24 


/* 


#define JMPS 


1024 


/* 


#define MX 






#define DMAT 


3 


/* 


#define DMIS 


0 


/* 


#define DINSO 


8 


/* 


#define DINS1 




/* 


#define PINSO 


8 


/* 


#define PINS1 


4 


/* 


struct jmp { 







struct diag { 



short 
unsigned short 



long 
short 
struct jmp 



n[MAXJMP]; /* size of jmp (neg for dely) */ 
x[MAXJMP]; /* base no. of jmp in seq x */ 
/* limits seq to 2" 16 -1 */ 



score; 
offset; 
ijmp; 



/* score at last jmp */ 
/* offset of prev block */ 
/* current jmp index */ 
/* list of jmps */ 



spc; /* number of leading spaces *i 

n[JMPS];/* size of jmp (gap) */ 

x[JMPS];/* loc of jmp (last elem before gap) */ 



char 



long 

struct 

struct 

char 
char 



diag 
path 



*ofile; 

*namex[2]; 

*prog; 

*seqx[2]; 

dmax; 

dmaxO; 

dna; 

endgaps; 
gapx, gapy; 
lenO, lenl; 
ngapx, ngapy; 
smax; 
*xbm; 
offset; 



*dx; 
PP[2]; 



/* output file name */ 

/* seq names: getseqs() */ 

/* prog name for err msgs */ 

/* seqs: getseqs() */ 

/* best diag: nw() */ 

/* final diag */ 

/* set if dna: main() */ 

/* set if penalizing end gaps */ 

/* total gaps in seqs */ 

/* seq lens */ 

/* total size of gaps */ 

/* max score: nw() */ 

/* bitmap for matching */ 

/* current offset in jmp file */ 

/* holds diagonals */ 

/* holds path for seqs */ 



*calloc(), *malloc(), *index(), *strcpy(); 
*g_calloc(); 



38 



Table 1 (conf) 

/* Needleman-Wunsch alignment program 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ' > ' or ' < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 

*/ 

^include "nw.h" 
#include "day.h" 

static _dbval[26] = { 

1,14,2,13,0,0,4,11,0,0,12,0,3,15,0,0,0,5,6,8,8,7,9,0,10,0 

}; 

static _pbval[26] = { 

1, 2|(1< <( , D'-'A , ))|(1< <('N , -'A')), 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1< < 10, 1<<11, 1< < 12, 1< < 13, 1<<14, 
1< <15, 1< <16, 1< <17, 1< <18, 1< <19, 1< <20, 1< <21, 1< « 
1<<23, 1<<24, K^SKK^'E'-'A'^KK^'Q'-'A')) 



char *av[]; 

prog = av[0]; 
if (ac ! = 3) { 

fprintf(stderr, "usage: %s filel file2\n", prog); 

fprintf(stderr, "where filel and file2 are two dna or two protein sequences. \n"); 
fprintf(stderr, "The sequences can be in upper- or lower-case\n"); 
fprintf(stderr, "Any lines beginning with ';' or 1 < ' are ignored\n"); 
fprintf(stderr, "Output is in the file \"align.out\"\n"); 
exit(l); 

> 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &len0); 

seqxfl] = getseq(namex[l], &lenl); 

xbm = (dna)? _dbval : _pbval; 

endgaps = 0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 

nw(); /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

print(); /* print stats, alignment */ 

cleanup(O); /* unlink any tmp files */ 



39 



Table 1 (conf) 

/* do the alignment, return best score: main() 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 
5 * a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nw() 

10 char *px, *py; /* seqs and ptrs */ 

hit *ndely, *dely; /* keep track of dely */ 

int ndelx, delx; /* keep track of delx */ 

int *tmp; /* for swapping rowO, rowl */ 

int mis; /* score for each type */ 

15 int insO, insl; /* insertion penalties */ 

register id; /* diagonal index */ 

register ij; /*jmp index*/ 

register *col0, *coll; /* score for curr, last row */ 

register xx, yy; /* index into seqs */ 

20 

Q dx = (struct diag *)g_calloc("to get diags", lenO+lenl + 1, sizeof(struct diag)); 

; f| ndely = (int *)g_calloc("to get ndely", lenl + 1, sizeof(int)); 

iT= dely = (int *)g_calloc("to get dely", Ienl + 1, sizeof(int)); 

; ,$5 colO = (int *)g_calloc("to get colO", Ienl + 1, sizeof(int)); 

' :yi coll = (int *)g_calloc("to get coll", lenl + 1, sizeof(int)); 

; \! insO = (dna)? DINSO : PESfSO; 

i ri insl = (dna)? DINS1 : PINS1; 

'30 smax = -10000; 

; a=. if (endgaps) { 

]S for (col0[0] = dely[0] = -insO, yy = 1; yy < = lenl; yy+ +) { 

col0[yy] = delyfyy] = col0[yy-l] - insl; 
ndely [yy] = yy; 

35 } 

= col0[0] = 0; /* Waterman Bull Math Biol 84 */ 

II > 

else 

for (yy = 1; yy < = lenl; yy-f- +) 
40 dely[yy] = -insO; 

/* fill in match matrix 

*/ 

for (px = seqx[0], xx = 1; xx < = lenO; px+ +, xx+ +) { 
45 /* initialize first entry in col 

*/ 

if (endgaps) { 

if (xx = = 1) 

coll[0] = delx = -(insO+insl); 



} 

else { 



coll[0] = delx = co!0[0] - insl; 
ndelx = xx; 



coll[0] = 0; 
delx = -insO; 
ndelx = 0; 
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for (py = seqx[l], yy = 1; yy < = lenl; py++, yy+ +) { 
mis = colO[yy-l]; 
if (dna) 

5 mis + = (xbm[*px-' A']&xbm[*py-'A'])? DMAT : DMIS; 

else 

mis + = _day[*px-'A'][*py-'A']; 

/* update penalty for del in x seq; 
10 * favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 

*/ 

if (endgaps | | ndely[yy] < MAXGAP) { 

if (col0[yy] - insO > = dely[yy]) { 
15 delyfyy] = col0[yy] - (insO + insl); 

ndely[yy] = 1; 

} else { 

dely[yy] -= insl; 
ndely[y y] + + ; 

20 } 

} else { 

^ if (col0[yy] - (insO + insl) > = dely[yy]) { 

delyfyy] = colO[yy] - (insO+insl); 
" ndely[yy] = 1; 

25 } else 

ndelyfyy] + + ; 

*: } 

/* update penalty for del in y seq; 
30 * favor new del over ongong del 

*/ 

7 if (endgaps | | ndelx < MAXGAP) { 

- if (coll[yy-l] - insO > = delx) { 

delx = coll[yy-l] - (insO+insl); 
35 ndelx = l; 

" } else { 

! delx -= insl; 

ndelx+ + ; 

40 } else { 

if (coll[yy-l] - (insO+insl) > = delx) { 
delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} else 

45 ndelx++; 

} 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
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id = xx - yy + lenl - 1 ; 
if (mis > = delx && mis > = dely[yy]) 
coIl[yy] = mis; 

5 else if (delx > = dely[yy]) { 

collfyy] = delx; 
ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna | | (ndelx > = MAXJMP 
&&xx > dx[id].jp.x[ij]+MX) | | mis > dx[id]. score +DINSO)) { 
10 dx[id].ijmp+ + ; 

if (++ij >= MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 
dx[id] .offset = offset; 

15 offset += sizeof (struct jmp) + sizeof(offset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
dx[id].jp.x[ij] = xx ! 

20 dx[id]. score = delx; 

} 

else { 

coll[yy] = delyfyy]; 
ij = dx[id].ijmp; 

25 if (dx[id].jp.n[0] && (!dna j | (ndelyfyy] > = MAXJMP 

&&xx > dx[id].jp.x[ij] + MX) | | mis > dx [id]. score +DINS0)) { 
dx[id].ijmp+ + ; 
if (+ +ij > = MAXJMP) { 
writejmps(id); 

^0 ij = dx[id].ijmp = 0; 

dx[id]. offset = offset; 

offset += sizeof(struct jmp) + sizeof (offset); 

r } } 

"35 dx[id].jp.n[ij] = -ndelyfyy]; 

dx[id].jp.x[ij] = xx; 
dx[id]. score = delyfyy]; 

} 

if (xx = = lenO && yy < lenl) { 
40 /* last col 

*/ 

if (endgaps) 

collfyy] -= ins0 + insl*(lenl-yy); 
if (collfyy] > smax) { 
45 smax = collfyy]; 

dmax = id; 

} 

} 

50 if (endgaps && xx < lenO) 

coll[yy-l] -= ins0+insl*(len0-xx); 
if (coll[yy-l] > smax) { 

smax = coll[yy-l]; 
dmax = id; 

55 } 

tmp = colO; colO = coll; coll = tmp; 

} 

(void) free((char *)ndely); 
(void) free((char *)dely); 
60 (void) free((char *)co!0); 

(void) free((char *)coll); 
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;»5 
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* printO - only routine visible outside this module 

* static: 

* getmatO -- trace back best path, count matches: print() 

* pr_align() - print alignment of described in array p[]: print() 

* dumpblockO ~ dump a block of lines with numbers, stars: pr_align() 

* numsO - put out a number line: dumpblock() 

* pudine() - put out a line (name, [num], seq, [num]): dumpblockO 

* stars() - -put a line of stars: dumpblockO 

* stripnameO — strip any path and prefix from a seqname 
*/ 

^include "nw.h" 



#define SPC 3 
#define P LINE 256 
#define P SPC 3 

extern _day[26][26]; 
int olen; 
FILE *fx; 



m output line */ 
/* space between name or num a 



/* set output line length */ 
/* output file */ 



print() 
{ 



print 



int 

if ((fx 
} 



lx, ly, firstgap, lastgap; /* overlap */ 



fopen(ofile, "w")) = 
fprintf(stderr,"%s: < 
cleanup(l); 



= 0){ 

n't write %s\n", prog, ofile); 



fprintf(fx, "<first sequence: %s (length = %d)\n", namex[0], lenO); 
fprintf(fx, "< second sequence: %s (length = %d)\n", namex[l], Ienl); 
olen = 60; 
lx = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp[0].spc = firstgap = lenl - dmax - 1; 
ly-= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap = dmax - (lenl - 1); 
lx -= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx -= lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - (lenO - 1); 
ly -= lastgap; 

> 

getmat(lx, ly, firstgap, lastgap); 
pr_align(); 



60 
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* trace back the best path, count matches 



getmatQx, ly, firstgap, lastgap) 

int lx, ly; /* "core" (minus endgaps) */ 

int firstgap, lastgap; /* leading trailing overlap */ 

{ 

int ran, iO, il, sizO, sizl; 

char outx[32]; 
double pet; 
register nO, nl; 

register char *p0, *pl; 

/* get total matches, score 
*/ 

iO = il = sizO = sizl = 0; 
pO = seqx[0] + pp[l].spc; 
pi = seqx[l] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[0].spc + 1; 



nm = 0; 

while ( *pO && *pl ) { 
if (sizO) { 

P l + + ; 
nl + + ; 
sizO--; 

} 

else if (sizl) { 

p0++; 
n0++; 
sizl--; 

} 

else { 

if (xbm[*pO-'A']&xbm[*pl-'A']) 

nm+ +; 
if (n0+ + == pp[0].x[iO]) 

sizO = pp[0].n[iO++]; 
if(nl + + ==pp[l].x[il]) 

sizl = pp[l].n[il + +]; 

p0++; 

pi + +; 

} 

} 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 

*/ 

if (endgaps) 

lx = (lenO < lenl)? lenO : lenl; 

else 

lx = (lx < ly)? lx : ly; 
pet = 100.*(double)nm/(doiible)lx; 
rprintf(fx, "\n"); 

fprintf(fx, " < %d match%s in an overlap of %d: % .2f percent similarity\n 
nm, (nm == 1)? "" : "es", lx, pet); 
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15 



fprintf(fx, "<gaps in first sequence: %d", gapx); 
if (gapx) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, (dna)? "base": "residue", (ngapx == 1)? "":"s"); 

fprintf(fx,"%s", outx); 

fprintf(fx, ", gaps in second sequence: %d", gapy); 
if (gapy) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapy, (dna)? "base": "residue", (ngapy == 1)? "":"s"); 
fprintf(fx,"%s", outx); 



...getmat 



} 

if (dna) 



fprintf(fx, 

"Vn< score: %d (match = %d, mismatch = 
smax, DMAT, DMIS, DINSO, DINS1); 



>d, gap penalty = %d + %d per base)\n", 



fprintf(fx, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per res 
smax, PINSO, PINS1); 
if (endgaps) 

fprintf(fx, 

"< endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (frrstgap == 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 

fprintf(fx, "< endgaps not penalized\n"); 



static nm; /* matches in core — for checking */ 

static lmax; /* lengths of stripped file names */ 

static ij[2]; /* jmp index for a path */ 

static nc[2]; /* number at start of current line */ 

static ni[2]; /* current elem number — for gapping */ 

static siz[2]; 

static char *ps[2]; /* ptr to current element */ 

static char *po[2]; /* ptr to next output char slot */ 

static char out[2][P_LINE]; /* output line */ 

static char star[P_LINE]; /* set by stars() */ 

/* 

* print alignment of described in struct path ppfl 

*/ 

static 

pr_align() 
{ 

int nn; /* char count */ 



pralign 



for (i = 0, lmax = 0; i < 2; i++) { 
nn = stripname(namex[i]); 
if (nn > lmax) 

lmax = nn; 

nc[i] = 1; 
ni[i] = 1; 
siz[i] = ij[i] = 0; 
ps[i] = seqx[i]; 
po[i] = out[i]; 
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10 



■30 



50 } 



for (nn = nm = 0, more = 1; more; ) { ...pr_align 

for (i = more = 0; i < 2; i+ +) { ~~ 

/* 

* do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 



if (pp[i].spc) { /* leading space */ 
*po[i] + + = ' '; 
pp[i].spc~; 

} 

else if (siz[i]) { /* in a gap */ 
*po[i]++ = '-'; 
siz[i]-; 

} 

else { /* we're putting a seq element 

*/ 

*po[i] = *ps[i]; 
if (islower(*ps[i])) 

*ps[i] = toupper(*ps[i]); 

po[i] + +; 
ps[i]+ + ; 



* are we at next gap for this seq? 

*/ 

if(ni[i] ==pp[i].x[ij[i]]){ 

/* 

* we need to merge all gaps 

* at this location 

*/ 

siz[i] = pp[ij.n[ij[i] + +]; 
whfle(ni[i] ==pp[i].x[ij[i]]) 

siz[i] +=pp[i]-n[ij[i] + +]; 

} 

ni[i] + + ; 



} 



} 

if (+ +nn = = olen [ | !more && nn) { 
dumpblockO; 
for (i = 0; i < 2; i++) 
po[i] = out[i]; 

nn = 0; 

} 



* dump a block of lines, including numbers, stars: pr_align() 

*/ 

55 static 

dumpblockO dumpblock 

{ 

register i; 

60 for (i = 0; i < 2; i+ +) 

*po[i]- = '\0'; 
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(void) putc('\n', fx); 

for (i = 0; i < 2; i++) { 

if (*out[i] && (*out[i] != " | | *(pc 

if a ==o) 

nums(i); 
if (i = = 0&& *out[l]) 
stars(); 

putline(i); 

if (i == 0&& *out[l]) 
fprintf(fx, star); 

if Ci == 1) 

nums(i); 



..dumpblock 



* put ( 

*/ 

static 



it a number line: dumpblock() 



v 35 



int ix; /* index in out[] holding seq line */ 

char nline[P_LINE]; 

register i, j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < lmax+PSPC; i+ +, pn+ +) 
*pn = 1 '; 

for (i = nc[ix], py = out[ix]; *py; py+ +, pn+ +) { 

if (*py = = ' ' | | *py == ._•) 



*pn = ; 

if (i%10 = = 0 | | (i = = 1 &&nc[ix] != 1)) { 
j = (i < 0)? -i : i; 
for (px = pn; j; j /= 10, px— ) 
*px = j%10 + '0'; 

if (i < 0) 

*px = '-'; 

} 

else 



} 

} 

*pn = '\0'; 
nc[ix] = i; 

for (pn = nline; *pn; pn+ +) 
(void) putc(*pn, fx); 
(void) putc('\n', fx); 



[num], seq, [num]): dumpblock() 



pudine(ix) 



putline 
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..putline 



register char *px; 

for (px = namex[ix], i = 0; *px && *px ! = ' : ' ; px+ + 

(void) putc(*px, fx); 
for (; i < lmax+P_SPC; i+ + ) 

(void) putc(' ', fx); 

/* these count from 1 : 

* ni[] is current element (from 1) 

* nc[] is number at start of current line 

*/ 

for (px = out[ix]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n', fx); 



* put a line of stars (seqs always in out[0], out[l]): dumpblock() 



25 stars() 
{ 



register char 



-30 if (!*out[0] | | (*out[0] == " && *(po[0]) = = ' ') | | 

!*out[l] | [ (*out[l] =="&& *( P o[l]) ==")) 

; px = star; 

] for (i = lmax+PSPC; i; i--) 

35 *px+ + = ' '; 

=- for (pO = out[0], pi = out[l]; *p0 && *pl; p0+ + , pi + +) { 

if (isalpha(*pO) && isalpha(*pl)) { 

40 if (xbmPpO-'A'J&xbmf^l-'A*]) { 

nm+ +; 

} 

else if (!dna && _day[*pO-'A'][*pl-'A'] > 0) 



*px++ = cx; 

} 

*px++ = '\n'; 
*px = '\0'; 



60 
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/* 

* strip path or prefix from pn, return len: pr_align() 

*/ 

static 

stripname(pn) Stripname 
char *pn; /* file name (may be path) */ 

{ 

register char *px, *py; 
py = 0; 

for (px = pn; *px; px++) 
if (* px == •/.) 

py = px + 1; 

if(py) 

(void) strcpy(pn, py); 
return(strlen(pn)) ; 



20 
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* cleanupO — cleanup any tmp file 

* getseqO — read in seq, set dna, len, maxlen 

* g_calloc() — calloc() with error checkin 

* readjmpsO - get the good jmps, from tmp file if necessary 

* writejmpsO — write a filled array of jmps to a tmp file: nw() 
*/ 

#include "nw.h" 
#include <sys/file.h> 



char 

FILE 



*jname = 7tmp/homgXXXXXX"; 
*fj; 



int cleanupO; 
long lseek(); 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



=!?5 



* remove any tmp file if we blow 

*/ 

cleanup(i) 

int i; 

{ 

if(fj) 

(void) unlink(jname); 

exit(i); 

} 



cleanup 



* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ' < ', or ' > ' 

* seq in upper or lower case 



:? 5 
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getseq(file, len) 
{ 



char 


•file; 


/* file name */ 


int 


*len; 


/* seq len */ 


char 




line[1024], *ps< 


register 


char 


*px, *py; 


int 




natgc, tlen; 


FILE 




*fp; 


if((fp = 


fopen(file 


S ,V)) ==0){ 



getseq 



[ | *line =='>') 



fprintf(stderr,"%s: can't read %s\n", prog, file); 
exit(l); 

> 

tlen = natgc = 0; 
while (fgets(line, 1024, fp)) { 

if (*line == ';' | | *line = 

continue; 
for (px = line; *px != '\n'; px++) 

if (isupper(*px) | | islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) == 0) { 

fprintf(stderr, " %s: malloc() failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pseq[0] = pseq[l] = pseq[2] = pseq[3] = '\0'; 
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py = pseq + 4; 
*len = tlen; 
rewind(fp); 

while (fgets(line, 1024, fp)) { 

if (*line == ';' | | *line == '<' | | *line =='>') 
continue; 

for (px = line; *px ! = '\n'; px+ +) { 
10 if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py++ = toupper(*px); 
if (index{ "ATGCU" , *(py- 1 ))) 
15 natgc++; 
} 

} 

*py++ = '\0'; 

*py = '\0'; 
20 (void) fclose(fp); 

dna = natgc > (tIen/3); 
T return(pseq+4); 
t } 

"25 char 

= g_cal!oc(msg, nx, sz) g_caIloC 



msg, in 


sz) 




char 


*msg; 


/* program, calling routine */ 


int 


nx, sz; 


/* number and size of elements */ 


char 




*px, *calloc(); 


if((px 


= calloc((unsigned)nx, (unsigned)sz)) = = 0) { 



30 

: if((px = 

if(*msg){ 

fprintf(stderr, "%s: g_calloc() failed %s (n=%d, sz = %d)\n", prog, msg, nx, sz); 
35 exit(l); 
I } 
= } 
- return(px); 

40 » 

/* 

* get final jmps from dx[] or tmp file, set pp[], reset dmax: main() 

*/ 

readjmpso readimps 
45 { 

int fd = -1; 



if(fj){ 

(void) fclose(fj); 

if ((fd = open(jname, O RDONLY, 0)) < 0) { 

fprintf(stderr, "%s: can't open() %s\n", prog, jname); 
cleanup(l); 

} 

} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i+ +) { 
while (1) { 

for (j = dx[dmax].ijmp; j > = 0 && dx[dmax].jp.x[j] > = xx; j-) 
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ifts 



Table 1 (conf) 

...readjmps 

if (j < 0 && dx[dmax]. offset && fj) { 

(void) lseek(fd, dx[dmax]. offset, 0); 

(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax]. offset, sizeof(dx[dmax]. offset)); 

dx[dmax].ijmp = MAXJMP-1; 

} 

else 

break; 

} 

if (i > = JMPS) { 

fprintf(stderr, " %s: too many gaps in alignment\n", prog); 
cleanup(l); 

} 

if(j > = 0){ 

siz = dx[dmax].jp.n[j]; 
xx = dx[dmax].jp.x[j]; 
dmax + = siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[il] = -siz; 
xx + = siz; 

/* id = xx - yy + lenl - 1 

*/ 

pp[l].x[il] = xx - dmax + lenl - 1; 
gapy++; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP | | endgaps)? -siz : MAXGAP; 
il + +; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[i0] = siz; 
pp[0].x[i0] = xx; 
gapx+ +; 
ngapx += siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
i0++; 

} 

> 

else 

break; 

} 

/* reverse the order of jmps 

*/ 

for (j = 0, i0~; j < iO; j + + , i0~) { 

i = pp[0].n[j]; pp[0].n[j] = pp[0].n[i0]; pp[0].n[i0] = i; 
i = pp[0].x[j]; pp[0].x[j] = pp[0].x[i0]; pp[0].x[i0] = i; 

> 

for(j =0, il-;j < il;j + + ,il-){ 

i = pp[l].n[j]; pp[l].n|j] = pp[l].n[il]; pp[l].n[il] = i; 
i = pp[l].x[j] ; pp[l].x[j] = pp[l].x[il]; pp[l].x[il] = i; 

> 

if (fd > = 0) 

(void) close(fd); 

if(tj){ 

(void) unlink(jname); 
fj =0; 
offset = 0; 
} } 
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Table 1 (conf) 

/* 

* write a filled jmp struct offset of the prev one (if any): nw() 

*/ 

writejmps(ix) WritejmpS 
int ix; 

{ 

char *mktemp(); 
if(!fj){ 

if (mktemp(jname) < 0) { 

fprintf(stderr, "%s: can't mktempO %s\n", prog, jname); 
cleanup(l); 

} 

if ((fj = fopen(jname, "w")) = = 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, jname); 
exit(l); 

} 

} 

(void) fwrite((char *)&dx[ix].jp, sizeof(struct jmp), 1, fj); 
(void) fwrite((char *)&dx[ix]. offset, sizeof(dx[ix]. offset), 1, fj); 
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Table 2 



TAT XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

5 % amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the TAT polypeptide) = 

10 5 divided by 15 = 33.3% 

Table 3 

% TAT XXXXXXXXXX (Length = 10 amino acids) 

JlJ-5 Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

I II % amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
jllO by ALIGN-2) divided by (the total number of amino acid residues of the TAT polypeptide) = 

(J 5 divided by 10 = 50% 

Table 4 

25 

TAT-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

30 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 
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Table 5 



TAT-DNA NNNNNNNNNNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 

4 divided by 12 = 33.3% 

IL Compositions and Methods of the Invention 

A. Anti-TAT Antibodies 

In one embodiment, the present invention provides anti-TAT antibodies which may find use herein as 
therapeutic and/or diagnostic agents. Exemplary antibodies include polyclonal, monoclonal, humanized, 
bispecific, and heteroconjugate antibodies. 

1. Polyclonal Antibodies 

Polyclonal antibodies are preferably raised in animals by multiple subcutaneous (sc) or intraperitoneal 
(ip) injections of the relevant antigen and an adjuvant. It may be useful to conjugate the relevant antigen 
(especially when synthetic peptides are used) to a protein that is immunogenic in the species to be immunized. 
For example, the antigen can be conjugated to keyhole limpet hemocyanin (KLH), serum albumin, bovine 
thyroglobulin, or soybean trypsin inhibitor, using a bifunctional or derivatizing agent, e.g., maleimidobenzoyl 
sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), 
glutaraldehyde, succinic anhydride, SOCl 2 , or R 1 N=C=NR, where R and R 1 are different alkyl groups. 

Animals are immunized against the antigen, immunogenic conjugates, or derivatives by combining, e.g. , 
100 jug or 5 y,g of the protein or conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's 
complete adjuvant and injecting the solution intradermally at multiple sites. One month later, the animals are 
boosted with 1/5 to 1/10 the original amount of peptide or conjugate in Freund's complete adjuvant by 
subcutaneous injection at multiple sites. Seven to 14 days later, the animals are bled and the serum is assayed 
for antibody titer. Animals are boosted until the titer plateaus. Conjugates also can be made in recombinant cell 
culture as protein fusions. Also, aggregating agents such as alum are suitably used to enhance the immune 
response. 

2. Monoclonal Antibodies 

Monoclonal antibodies may be made using the hybridoma method first described by Kohler et al., 
Nature, 256:495 (1975), or may be made by recombinant DNA methods (U.S. Patent No. 4,816,567). 
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In the hybridoma method, a mouse or other appropriate host animal, such as a hamster, is immunized 
as described above to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro. After 
immunization, lymphocytes are isolated and then fused with a myeloma cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice . 
5 pp. 59-103 (Academic Press, 1986)). 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium which medium 
preferably contains one or more substances that inhibit the growth or survival of the unfused, parental myeloma 
cells (also referred to as fusion partner). For example, if the parental myeloma cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the selective culture medium for the 
10 hybridomas typically will include hypoxanthine, aminopterin, and thymidine (HAT medium), which substances 
prevent the growth of HGPRT-deficient cells. 

Preferred fusion partner myeloma cells are those that fuse efficiently, support stable high-level 
production of antibody by the selected antibody-producing cells, and are sensitive to a selective medium that 
s q selects against the unfused parental cells. Preferred myeloma cell lines are murine myeloma lines, such as those 

^1.5 derived from MOPC-21 and MPC-1 1 mouse tumors available from the Salk Institute Cell Distribution Center, 
'.fi San Diego, California USA, and SP-2 and derivatives e.g., X63-Ag8-653 cells available from the American 

-j Type Culture Collection, Manassas, Virginia, USA. Human myeloma and mouse-human heteromyeloma cell 

;~ lines also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol. . 

= 133:3001 (1984); and Brodeur et al. . Monoclonal Antibody Production Techniques and Applications , pp. 51-63 

~;20 (Marcel Dekker, Inc., New York, 1987)). 

1^ Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 

= h antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies produced 

s 7 by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 

radioimmunoassay (RIA) or enzyme-linked immunosorbent assay (ELISA). 
25 The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard 

analysis described in Munson et al., Anal. Biochem. . 107:220 (1980). 

Once hybridoma cells that produce antibodies of the desired specificity, affinity, and/or activity are 
identified, the clones may be subcloned by limiting dilution procedures and grown by standard methods (Goding, 
Monoclonal Antibodies: Principles and Practice , pp. 59-103 (Academic Press, 1986)). Suitable culture media 
30 for this purpose include, for example, D-MEM or RPMI-1640 medium. In addition, the hybridoma cells may 
be grown in vivo as ascites tumors in an animal e.g,, by i.p. injection of the cells into mice. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional antibody purification procedures such as, for example, affinity 
chromatography (e.g., using protein A or protein G-Sepharose) or ion-exchange chromatography, 
35 hydroxylapatite chromatography, gel electrophoresis, dialysis, etc. 

DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the 



heavy and light chains of murine antibodies). The hybridoma cells serve as a preferred source of such DNA. 
Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such 
as E. coli cells, simian COS cells, Chinese Hamster Ovary (CHO) cells, or myeloma cells that do not otherwise 
produce antibody protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells . Review 
articles on recombinant expression in bacteria of DNA encoding the antibody include Skerra et al., Curr. 
5 Opinion in Immunol. . 5:256-262 (1993) and Pluckthun, Immunol. Revs. 130:151-188 (1992). 

In a further embodiment, monoclonal antibodies or antibody fragments can be isolated from antibody 
phage libraries generated using the techniques described in McCafferty et al., Nature , 348:552-554 (1990). 
Clackson et al., Nature . 352:624-628 (1991) and Marks et al., J. Mol. Biol. . 222:581-597 (1991) describe the 
isolation of murine and human antibodies, respectively, using phage libraries. Subsequent publications describe 
10 the production of high affinity (nM range) human antibodies by chain shuffling (Marks et al., Bio/Technology . 
10:779-783 (1992)), as well as combinatorial infection and in vivo recombination as a strategy for constructing 
very large phage libraries (Waterhouse et al., Nuc. Acids. Res. 21 :2265-2266 (1993)). Thus, these techniques 
are viable alternatives to traditional monoclonal antibody hybridoma techniques for isolation of monoclonal 
= antibodies. 

15 The DNA that encodes the antibody may be modified to produce chimeric or fusion antibody 

polypeptides, for example, by substituting human heavy chain and light chain constant domain (C H and C L ) 
sequences for the homologous murine sequences (U.S. Patent No. 4,816,567; and Morrison, et al., Proc. Natl 
Acad. Sci. USA . 81:6851 (1984)), or by fusing the immunoglobulin coding sequence with all or part of the 
coding sequence for a non-immunoglobulin polypeptide (heterologous polypeptide). The non-immunoglobulin 

-20 polypeptide sequences can substitute for the constant domains of an antibody, or they are substituted for the 
a variable domains of one antigen-combining site of an antibody to create a chimeric bivalent antibody comprising 

one antigen-combining site having specificity for an antigen and another antigen-combining site having specificity 
for a different antigen. 

3. Human and Humanized Antibodies 

25 The anti-TAT antibodies of the invention may further comprise humanized antibodies or human 

antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 1 , F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 

30 complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 
some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 

35 substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
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at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
et al., Nature . 321:522-525 (1986); Riechmann et al., Nature . 332:323-329 (1988); and Presta, Curr. Op. 
Struct. Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
5 human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" 
variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
[Jones et al., Nature . 321:522-525 (1986); Riechmann et al., Nature . 332:323-327 (1988); Verhoeyen et al., 
Science . 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
10 No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 
iQ The choice of human variable domains, both light and heavy, to be used in making the humanized 

; ML 5 antibodies is very important to reduce antigenicity and HAMA response (human anti -mouse antibody) when the 
^ antibody is intended for human therapeutic use. According to the so-called "best-fit" method, the sequence of 
; y the variable domain of a rodent antibody is screened against the entire library of known human variable domain 

; ~ sequences. The human V domain sequence which is closest to that of the rodent is identified and the human 

framework region (FR) within it accepted for the humanized antibody (Sims et al., J. Immunol. 151:2296 
} :;20 (1993); Chothia et al., J. Mol. Biol. . 196:901 (1987)). Another method uses a particular framework region 
I ^ derived from the consensus sequence of all human antibodies of a particular subgroup of light or heavy chains. 

; h The same framework may be used for several different humanized antibodies (Carter et al. , Proc. Natl. Acad. 
! J Sci. USA . 89:4285 (1992); Presta et al., J. Immunol. 151:2623 (1993)). 

It is further important that antibodies be humanized with retention of high binding affinity for the antigen 
25 and other favorable biological properties. To achieve this goal, according to a preferred method, humanized 
antibodies are prepared by a process of analysis of the parental sequences and various conceptual humanized 
products using three-dimensional models of the parental and humanized sequences. Three-dimensional 
immunoglobulin models are commonly available and are familiar to those skilled in the art. Computer programs 
are available which illustrate and display probable three-dimensional conformational structures of selected 
30 candidate immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the 
residues in the functioning of the candidate immunoglobulin sequence, i.e. , the analysis of residues that influence 
the ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues can be selected and 
combined from the recipient and import sequences so that the desired antibody characteristic, such as increased 
affinity for the target antigen(s), is achieved. In general, the hypervariable region residues are directly and most 
35 substantially involved in influencing antigen binding. 

Various forms of a humanized anti-TAT antibody are contemplated. For example, the humanized 
antibody may be an antibody fragment, such as a Fab, which is optionally conjugated with one or more cytotoxic 



agent(s) in order to generate an immunoconjugate. Alternatively, the humanized antibody may be an intact 
antibody, such as an intact IgGl antibody. 

As an alternative to humanization, human antibodies can be generated. For example, it is now possible 
to produce transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full repertoire 
of human antibodies in the absence of endogenous immunoglobulin production. For example, it has been 
5 described that the homozygous deletion of the antibody heavy-chain joining region (J H ) gene in chimeric and 
germ-line mutant mice results in complete inhibition of endogenous antibody production. Transfer of the human 
germ-line immunoglobulin gene array into such germ-line mutant mice will result in the production of human 
antibodies upon antigen challenge. See, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA . 90:2551 (1993); 
Jakobovits et al., Nature . 362:255-258 (1993); Bruggemann et al., Year in Immuno. 7:33 (1993); U.S. Patent 
10 Nos. 5,545,806, 5,569,825, 5,591,669 (all of GenPharm); 5,545,807; and WO 97/17852. 

Alternatively, phage display technology (McCafferty et al., Nature 348:552-553 [1990]) can be used 
to produce human antibodies and antibody fragments in vitro, from immunoglobulin variable (V) domain gene 
repertoires from unimmunized donors. According to this technique, antibody V domain genes are cloned in- 
frame into either a major or minor coat protein gene of a filamentous bacteriophage, such as M13 or fd, and 
- 1 5 displayed as functional antibody fragments on the surface of the phage particle. Because the filamentous particle 
contains a single-stranded DNA copy of the phage genome, selections based on the functional properties of the 
antibody also result in selection of the gene encoding the antibody exhibiting those properties. Thus, the phage 
mimics some of the properties of the B-cell. Phage display can be performed in a variety of formats, reviewed 
in, e.g., Johnson, Kevin S. and Chiswell, David J., Current Opinion in Structural Biology 3:564-571 (1993). 
-20 Several sources of V-gene segments can be used for phage display. Clacksonet al.. Nature . 352:624-628 (1991) 
isolated a diverse array of anti-oxazolone antibodies from a small random combinatorial library of V genes 
derived from the spleens of immunized mice. A repertoire of V genes from unimmunized human donors can 
be constructed and antibodies to a diverse array of antigens (including self-antigens) can be isolated essentially 
following the techniques described by Marks et al. , J. Mol. Biol. 222:581-597 (1991), or Griffith et al. , EMBO 
25 L 12:725-734 (1993). See, also, U.S. Patent Nos. 5,565,332 and 5,573,905. 

As discussed above, human antibodies may also be generated by in vitro activated B cells (see U.S. 
Patents 5,567,610 and 5,229,275). 

4. Antibody fragments 
In certain circumstances there are advantages of using antibody fragments, rather than whole antibodies . 
30 The smaller size of the fragments allows for rapid clearance, and may lead to improved access to solid tumors. 

Various techniques have been developed for the production of antibody fragments. Traditionally, these 
fragments were derived via proteolytic digestion of intact antibodies (see, e.g., Morimoto et al., Journal of 
Biochemical and Biophysical Methods 24:107-117 (1992); and Brennan et al., Science . 229:81 (1985)). 
However, these fragments can now be produced directly by recombinant host cells. Fab, Fv and ScFv antibody 
35 fragments can all be expressed in and secreted from E. coli, thus allowing the facile production of large amounts 
of these fragments. Antibody fragments can be isolated from the antibody phage libraries discussed above. 
Alternatively , Fab ' -SH fragments can be directly recovered from E. coli and chemically coupled to form F(ab ' ) 2 
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fragments (Carter etal., Bio/Technology 10: 163-167 (1992)). According to another approach, F(ab') 2 fragments 
can be isolated directly from recombinant host cell culture. Fab and F(ab') 2 fragment with increased in vivo 
half-life comprising a salvage receptor binding epitope residues are described in U.S. Patent No. 5,869,046. 
Other techniques for the production of antibody fragments will be apparent to the skilled practitioner. In other 
embodiments, the antibody of choice is a single chain Fv fragment (scFv). See WO 93/16185; U.S. Patent No. 
5 5,571 ,894; and U.S. Patent No. 5,587,458. Fv and sFv are the only species with intact combining sites that are 
devoid of constant regions; thus, they are suitable for reduced nonspecific binding during in vivo use. sFv fusion 
proteins may be constructed to yield fusion of an effector protein at either the amino or the carboxy terminus 
of an sFv. See Antibody Engineering , ed. Borrebaeck, supra. The antibody fragment may also be a "linear 
antibody", e.g., as described in U.S. Patent 5,641,870 for example. Such linear antibody fragments may be 
10 monospecific or bispecific. 

5. Bispecific Antibodies 

Bispecific antibodies are antibodies that have binding specificities for at least two different epitopes. 
Exemplary bispecific antibodies may bind to two different epitopes of a TAT protein as described herein. Other 

= such antibodies may combine a TAT binding site with a binding site for another protein. Alternatively, an anti- 

15 TAT arm may be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell 
receptor molecule (e.g. CD3), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and 
FcyRIII (CD16), so as to focus and localize cellular defense mechanisms to the TAT-expressing cell. Bispecific 
antibodies may also be used to localize cytotoxic agents to cells which express TAT. These antibodies possess 
a TAT-binding arm and an arm which binds the cytotoxic agent (e.g. , saporin, anti-interferon-a, vinca alkaloid, 

=20 ricin A chain, methotrexate or radioactive isotope hapten). Bispecific antibodies can be prepared as full length 
antibodies or antibody fragments (e.g., F(ab')2 bispecific antibodies). 

I WO 96/16673 describes a bispecific anti-ErbB2/anti-FcyRIII antibody and U.S. Patent No. 5,837,234 

~ discloses a bispecific anti-ErbB2/anti-FcyRI antibody. A bispecific anti-ErbB2/Fccc antibody is shown in 

WO98/02463. U.S. Patent No. 5,821,337 teaches a bispecific anti-ErbB2/anti-CD3 antibody. 

25 Methods for making bispecific antibodies are known in the art. Traditional production of full length 

bispecific antibodies is based on the co-expression of two immunoglobulin heavy chain-light chain pairs, where 
the two chains have different specificities (Millstein et al., Nature 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture 
of 10 different antibody molecules, of which only one has the correct bispecific structure. Purification of the 

30 correct molecule, which is usually done by affinity chromatography steps , is rather cumbersome, and the product 
yields are low. Similar procedures are disclosed in WO 93/08829, and in Traunecker et al. , EMBO J. 10:3655- 
3659 (1991). 

According to a different approach, antibody variable domains with the desired binding specificities 
(antibody-antigen combining sites) are fused to immunoglobulin constant domain sequences. Preferably, the 
35 fusion is with an Ig heavy chain constant domain, comprising at least part of the hinge, C H 2, and C H 3 regions. 
It is preferred to have the first heavy-chain constant region (C H 1) containing the site necessary for light chain 
bonding, present in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, 
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if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected 
into a suitable host cell. This provides for greater flexibility in adjusting the mutual proportions of the three 
polypeptide fragments in embodiments when unequal ratios of the three polypeptide chains used in the 
construction provide the optimum yield of the desired bispecific antibody. It is, however, possible to insert the 
coding sequences for two or all three polypeptide chains into a single expression vector when the expression of 
5 at least two polypeptide chains in equal ratios results in high yields or when the ratios have no significant affect 
on the yield of the desired chain combination. 

In a preferred embodiment of this approach, the bispecific antibodies are composed of a hybrid 
immunoglobulin heavy chain with a first binding specificity in one arm, and a hybrid immunoglobulin heavy 
chain-light chain pair (providing a second binding specificity) in the other arm. It was found that this asymmetric 

10 structure facilitates the separation of the desired bispecific compound from unwanted immunoglobulin chain 
combinations, as the presence of an immunoglobulin light chain in only one half of the bispecific molecule 
provides for a facile way of separation. This approach is disclosed in WO 94/04690. For further details of 
generating bispecific antibodies see, for example, Suresh et al., Methods in Enzvmology 121:210 (1986). 

According to another approach described in U.S. Patent No. 5,731,168, the interface between a pair 

15 of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from 
recombinant cell culture. The preferred interface comprises at least a part of the C H 3 domain. In this method, 
one or more small amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g., tyrosine or tryptophan) . Compensatory " cavities " of identical or similar size to the large 
side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side 

20 chains with smaller ones (e.g., alanine or threonine) . This provides a mechanism for increasing the yield of the 
heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies include cross-linked or "heteroconjugate" antibodies. For example, one of the 
antibodies in the heteroconjugate can be coupled to avidin, the other to biotin. Such antibodies have, for 
example, been proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for 

25 treatment of HIV infection (WO 91/00360, WO 92/200373, and EP 03089). Heteroconjugate antibodies may 
be made using any convenient cross-linking methods. Suitable cross-linking agents are well known in the art, 
and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross-linking techniques. 

Techniques for generating bispecific antibodies from antibody fragments have also been described in 
the literature. For example, bispecific antibodies can be prepared using chemical linkage. Brennan et al., 

30 Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate 
F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing agent, sodium 
arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then 
reconverted to the Fab '-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of 

35 the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
as agents for the selective immobilization of enzymes. 
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Recent progress has facilitated the direct recovery of Fab'-SH fragments from E. coli, which can be 
chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175: 217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' fragment was separately 
secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody. The 
bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human 
5 T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kostelny et al., J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and 
Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
10 homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a V H connected to 
= a V L by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, 

15 the V H and V L domains of one fragment are forced to pair with the complementary V L and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See Gruber et al., J. Immunol. . 
152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be 
=20 prepared. Tutt et al.. J. Immunol. 147:60 (1991). 

6. Heteroconjugate Antibodies 
Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
~ antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 

to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection 
25 [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 
30 7. Multivalent Antibodies 

A multivalent antibody may be internalized (and/or catabolized) faster than a bivalent antibody by a cell 
expressing an antigen to which the antibodies bind. The antibodies of the present invention can be multivalent 
antibodies (which are other than of the IgM class) with three or more antigen binding sites (e.g. tetravalent 
antibodies), which can be readily produced by recombinant expression of nucleic acid encoding the polypeptide 
35 chains of the antibody . The multivalent antibody can comprise a dimerization domain and three or more antigen 
binding sites. The preferred dimerization domain comprises (or consists of) an Fc region or a hinge region. In 
this scenario, the antibody will comprise an Fc region and three or more antigen binding sites amino-terminal 



to the Fc region. The preferred multivalent antibody herein comprises (or consists of) three to about eight, but 
preferably four, antigen binding sites. The multivalent antibody comprises at least one polypeptide chain (and 
preferably two polypeptide chains), wherein the polypeptide chain(s) comprise two or more variable domains. 
For instance, the polypeptide chain(s) may comprise VDl-(Xl) n -VD2-(X2) n -Fc, wherein VD1 is a first variable 
domain, VD2 is a second variable domain, Fc is one polypeptide chain of an Fc region, XI and X2 represent 
5 an amino acid or polypeptide, and n is 0 or 1 . For instance, the polypeptide chain(s) may comprise: VH-CH1- 
flexible linker-VH-CHl-Fc region chain; or VH-CHl-VH-CHl-Fc region chain. The multivalent antibody herein 
preferably further comprises at least two (and preferably four) light chain variable domain polypeptides. The 
multivalent antibody herein may, for instance, comprise from about two to about eight light chain variable 
domain polypeptides. The light chain variable domain polypeptides contemplated here comprise a light chain 
10 variable domain and, optionally, further comprise a CL domain. 

8. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, e.g., so 
as to enhance antigen-dependent cell-mediated cyotoxicity (ADCC) and/or complement dependent cytotoxicity 
~ (CDC) of the antibody. This may be achieved by introducing one or more amino acid substitutions in an Fc 

15 region of the antibody. Alternatively or additionally, cysteine residue(s) may be introduced in the Fc region, 
thereby allowing interchain disulfide bond formation in this region. The homodimeric antibody thus generated 
may have improved internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al. , J. Exp Med. 176: 1 191-1 195 (1992) and Shopes, B. 
J. Immunol. 148:2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity may also be 
20 prepared using heterobifunctional cross-linkers as described in Wolff et al., Cancer Research 53:2560-2565 
(1993). Alternatively, an antibody can be engineered which has dual Fc regions and may thereby have enhanced 
- complement lysis and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design 3:219-230 (1989). 

: * To increase the serum half life of the antibody, one may incorporate a salvage receptor binding epitope 

into the antibody (especially an antibody fragment) as described in U.S. Patent 5,739,277, for example. As used 
25 herein, the term "salvage receptor binding epitope" refers to an epitope of the Fc region of an IgG molecule 
(e.g., IgG 1; IgG 2 , IgG 3 , or IgG 4 ) that is responsible for increasing the in vivo serum half-life of the IgG 
molecule. 

9. Irnmunoconjugates 

The invention also pertains to irnmunoconjugates comprising an antibody conjugated to a cytotoxic agent 
30 such as a chemotherapeutic agent, a growth inhibitory agent, a toxin {e.g. , an enzymatically active toxin of 
bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., aradioconjugate). 

Chemotherapeutic agents useful in the generation of such irnmunoconjugates have been described above. 
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
35 modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are 



available for the production of radioconjugated antibodies. Examples include 212 Bi, I31 I, 131 In, ^Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 
agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), 
aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
5 diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine) , diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene) . For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al. , Science . 238 : 1098 (1987). Carbon- 14-labeled 1- 
isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleotide to the antibody. See W094/1 1026. 
10 Conjugates of an antibody and one or more small molecule toxins, such as a calicheamicin, 

maytansinoids, a trichothene, and CC1065, and the derivatives of these toxins that have toxin activity, are also 
contemplated herein. 
Maytansine and maytansinoids 

= In one preferred embodiment, an anti-TAT antibody (full length or fragments) of the invention is 

15 conjugated to one or more maytansinoid molecules. 

Maytansinoids are mitototic inhibitors which act by inhibiting tubulin polymerization. Maytansine was 
first isolated from the east African shrub May terms serrata (U.S. Patent No. 3,896,1 1 1). Subsequently, it was 
discovered that certain microbes also produce maytansinoids, such as maytansinol and C-3 maytansinol esters 
(U.S. Patent No. 4,151,042). Synthetic maytansinol and derivatives and analogues thereof are disclosed, for 

■20 example, in U.S. Patent Nos. 4,137,230; 4,248,870; 4,256,746; 4,260,608; 4,265,814; 4,294,757; 4,307,016; 
4,308,268; 4,308,269; 4,309,428; 4,313,946; 4,315,929; 4,317,821; 4,322,348; 4,331,598; 4,361,650; 

1 4,364,866; 4,424,219; 4,450,254; 4,362,663; and 4,371,533, the disclosures of which are hereby expressly 

incorporated by reference. 
Mavtansinoid-antibodv conjugates 

25 In an attempt to improve their therapeutic index, maytansine and maytansinoids have been conjugated 

to antibodies specifically binding to tumor cell antigens. Immunoconjugates containing maytansinoids and their 
therapeutic use are disclosed, for example, in U.S. Patent Nos. 5,208,020, 5,416,064 and European Patent EP 
0 425 235 Bl, the disclosures of which are hereby expressly incorporated by reference. Liu et al., Proc. Natl. 
Acad. Sci. USA 93:8618-8623 (1996) described immunoconjugates comprising a maytansinoid designated DM1 

30 linked to the monoclonal antibody C242 directed against human colorectal cancer. The conjugate was found to 
be highly cytotoxic towards cultured colon cancer cells, and showed antitumor activity in an in vivo tumor 
growth assay. Chari et al., Cancer Research 52:127-131 (1992) describe immunoconjugates in which a 
maytansinoid was conjugated via a disulfide linker to the murine antibody A7 binding to an antigen on human 
colon cancer cell lines, or to another murine monoclonal antibody TA.l that binds the HER-2/«e« oncogene. 

35 The cytotoxicity of the TA. 1-maytansonoid conjugate was tested in vitro on the human breast cancer cell line 
SK-BR-3, which expresses 3 x 10 5 HER-2 surface antigens per cell. The drug conjugate achieved a degree of 
cytotoxicity similar to the free maytansonid drug, which could be increased by increasing the number of 



maytansinoid molecules per antibody molecule. The A7-maytansinoid conjugate showed low systemic 
cytotoxicity in mice. 

Anti-TAT polypeptide antibody-maytansinoid conjugates (immunoconjugates) 

Anti-TAT antibody-maytansinoid conjugates are prepared by chemically linking an anti-TAT antibody 
to a maytansinoid molecule without significantly diminishing the biological activity of either the antibody or the 
5 maytansinoid molecule . An average of 3 -4 maytansinoid molecules conjugated per antibody molecule has shown 
efficacy in enhancing cytotoxicity of target cells without negatively affecting the function or solubility of the 
antibody, although even one molecule of toxin/antibody would be expected to enhance cytotoxicity over the use 
of naked antibody. Maytansinoids are well known in the art and can be synthesized by known techniques or 
isolated from natural sources. Suitable maytansinoids are disclosed, for example, in U.S. Patent No. 5,208,020 
10 and in the other patents and nonpatent publications referred to hereinabove. Preferred maytansinoids are 
maytansinol and maytansinol analogues modified in the aromatic ring or at other positions of the maytansinol 
molecule, such as various maytansinol esters. 

There are many linking groups known in the art for making antibody-maytansinoid conjugates, 
g including, for example, those disclosed in U.S. Patent No. 5,208,020 or EP Patent 0 425 235 Bl, and Chari et 
ft 5 al., Cancer Research 52:127-131 (1992). The linking groups include disufide groups, fhioether groups, acid 
f± labile groups, photolabile groups, peptidase labile groups, or esterase labile groups, as disclosed in the above- 

y identified patents, disulfide and thioether groups being preferred. 

Conjugates of the antibody and maytansinoid may be made using a variety of bifunctional protein 
coupling agents such as N-succinimidyl-3-(2-pyridyldithio) propionate (SPDP), succinimidyl-4-(N- 
*£0 maleimidomethyl) cyclohexane-l-carboxylate, iminothiolane (IT), bifunctional derivatives of imidoesters (such 
a! L as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
jj= glutareldehyde) , bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine) , bis-diazonium derivatives 
^ (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as toluene 2,6-diisocyanate), and bis- 

active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). Particularly preferred coupling agents 
25 include N-succinimidyl-3-(2-pyridyldithio) propionate (SPDP) (Carlsson et al. , Biochem. J. 173 :723-737 [1978]) 
and N-succinimidyl-4-(2-pyridylthio)pentanoate (SPP) to provide for a disulfide linkage. 

The linker may be attached to the maytansinoid molecule at various positions, depending on the type 
of the link. For example, an ester linkage may be formed by reaction with a hydroxyl group using conventional 
coupling techniques. The reaction may occur at the C-3 position having a hydroxyl group, the C-14 position 
30 modified with hyrdoxy methyl, the C-15 position modified with a hydroxyl group, and the C-20 position having 
a hydroxyl group. In a preferred embodiment, the linkage is formed at the C-3 position of maytansinol or a 
maytansinol analogue. 
Calicheamicin 

Another immunoconjugate of interest comprises an anti-TAT antibody conjugated to one or more 
35 calicheamicin molecules . The calicheamicin family of antibiotics are capable of producing double-stranded DNA 
breaks at sub-picomolar concentrations. For the preparation of conjugates of the calicheamicin family, see U.S. 
patents 5,712,374, 5,714,586, 5,739,116, 5,767,285, 5,770,701, 5,770,710, 5,773,001, 5,877,296 (all to 
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American Cyanamid Company). Structural analogues of calicheamicin which may be used include, but are not 
limited to, y/, aj, a-$, N-acetyl-y/, PSAG and Q l l (Hinman et al., Cancer Research 53:3336-3342 (1993), 
Lode et al., Cancer Research 58:2925-2928 (1998) and the aforementioned U.S. patents to American Cyanamid). 
Another anti-tumor drug that the antibody can be conjugated is QFA which is an antifolate. Both calicheamicin 
and QFA have intracellular sites of action and do not readily cross the plasma membrane. Therefore, cellular 
5 uptake of these agents through antibody mediated internalization greatly enhances their cytotoxic effects. 
Other cytotoxic agents 

Other antitumor agents that can be conjugated to the anti-TAT antibodies of the invention include 
BCNU, streptozoicin, vincristine and 5-fluorouracil, the family of agents known collectively LL-E33288 
complex described in U.S. patents 5,053,394, 5,770,710, as well as esperamicins (U.S. patent 5,877,296). 
10 Enzymatically active toxins and fragments thereof which can be used include diphtheria A chain, 

nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
._ americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 

^ officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin and the tricothecenes. See, for 

15 example, WO 93/21232 published October 28, 1993. 

The present invention further contemplates an immunoconjugate formed between an antibody and a 
compound with nucleolytic activity (e.g., a ribonuclease or a DNA endonuclease such as a deoxyribonuclease; 
DNase). 

For selective destruction of the tumor, the antibody may comprise a highly radioactive atom. A variety 
"20 of radioactive isotopes are available for the production of radioconjugated anti-TAT antibodies. Examples 
include At 211 , I 131 , I 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 , Pb 212 and radioactive isotopes of Lu. When the 
conjugate is used for diagnosis, it may comprise a radioactive atom for scintigraphic studies, for example tc 99m 
or I 123 , or a spin label for nuclear magnetic resonance (NMR) imaging (also known as magnetic resonance 
imaging, mri), such as iodine-123 again, iodine-131, indium-Ill, fluorine-19, carbon-13, nitrogen-15, oxygen- 
25 17, gadolinium, manganese or iron. 

The radio- or other labels may be incorporated in the conjugate in known ways. For example, the 
peptide may bebiosynthesized or may be synthesized by chemical amino acid synthesis using suitable amino acid 
precursors involving, for example, fluorine-19 in place of hydrogen. Labels such as tc 99m or I 123 , .Re 186 , Re 188 
and In 111 can be attached via a cysteine residue in the peptide. Yttrium-90 can be attached via a lysine residue. 
30 The IODOGEN method (Fraker et al (1978) Biochem. Biophys. Res. Commun. 80: 49-57 can be used to 
incorporate iodine-123. "Monoclonal Antibodies in Immunoscintigraphy" (Chatal,CRC Press 1989) describes 
other methods in detail. 

Conjugates of the antibody and cytotoxic agent may be made using a variety of bifunctional protein 
coupling agents such as N-succinimidyl-3-(2-pyridyldithio) propionate (SPDP), succinimidyl-4-(N- 
35 maleimidomethyl) cyclohexane-l-carboxylate, iminothiolane (IT), bifunctional derivatives of imidoesters (such 
as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
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glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis- 
active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can 
be prepared as described in Vitetta et al., Science 238: 1098 (1987). Carbon- 14-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX- DTP A) is an exemplary chelating agent for conjugation of 
5 radionucleotide to the antibody. See W094/1 1026. The linker may be a "cleavable linker" facilitating release 
of the cytotoxic drug in the cell. For example, an acid-labile linker, peptidase-sensitive linker, photolabile 
linker, dimethyl linker or disulfide-containing linker (Chari et al., Cancer Research 52:127-131 (1992); U.S. 
Patent No. 5,208,020) may be used. 

Alternatively, a fusion protein comprising the anti-TAT antibody and cytotoxic agent may be made, 
10 e.g., by recombinant techniques or peptide synthesis. The length of DNA may comprise respective regions 
encoding the two portions of the conjugate either adjacent one another or separated by a region encoding a linker 
peptide which does not destroy the desired properties of the conjugate. 

In yet another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pre-targeting wherein the antibody-receptor conjugate is administered to the patient, followed 
15 by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) which is conjugated to a cytotoxic agent (e.g., a radionucleotide). 
10. Immunoliposomes 
The anti-TAT antibodies disclosed herein may also be formulated as immunoliposomes. A "liposome" 
is a small vesicle composed of various types of lipids, phospholipids and/or surfactant which is useful for 
;20 delivery of a drug to a mammal. The components of the liposome are commonly arranged in a bilayer 
,. formation, similar to the lipid arrangement of biological membranes. Liposomes containing the antibody are 

* prepared by methods known in the art, such as described in Epstein et al. , Proc. Natl. Acad. Sci. USA 82:3688 

(1985); Hwang et al., Proc. Natl Acad. Sci. USA 77:4030 (1980); U.S. Pat. Nos. 4,485,045 and 4,544,545; 
and W097/38731 published October 23, 1997. Liposomes with enhanced circulation time are disclosed in U.S. 
25 Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol and PEG-derivatized phosphatidylethanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
Fab ' fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
30 et al., J. Biol. Chem. 257:286-288 (1982) via a disulfide interchange reaction. A chemotherapeutic agent is 
optionally contained within the liposome. See Gabizon et al., J. National Cancer Inst. 81(19):1484 (1989). 
B. TAT Binding Oligopeptides 

TAT binding oligopeptides of the present invention are oligopeptides that bind, preferably specifically, 
to a TAT polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using 
35 known oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. TAT 
binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 
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38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable of 
binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may be 
identified without undue experimentation using well known techniques . In this regard, it is noted that techniques 
5 for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a polypeptide 
target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 4,833,092, 
5,223,409, 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and WO84/03564; Geysen 
et al., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Geysen et al., Proc. Natl. Acad. Sci. U.S.A., 
82: 178-182 (1985); Geysen et al., in Synthetic Peptides as Antigens, 130-149 (1986); Geysen et al. , J. Immunol. 
10 Meth., 102:259-274 (1987); Schoofs et al., J. Immunol., 140:611-616 (1988), Cwirla, S. E. et al. (1990) Proc. 
Natl. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, T. et al. 
(1991) Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) Proc. 
Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). 
; p In this regard, bacteriophage (phage) display is one well known technique which allows one to screen 

] -1 5 large oligopeptide libraries to identify member(s) of those libraries which are capable of specifically binding to 
; fl a polypeptide target. Phage display is a technique by which variant polypeptides are displayed as fusion proteins 

'""4 to the coat protein on the surface of bacteriophage particles (Scott, J.K. and Smith, G. P. (1990) Science 249: 

'jk 386). The utility of phage display lies in the fact that large libraries of selectively randomized protein variants 
(or randomly cloned cDNAs) can be rapidly and efficiently sorted for those sequences that bind to a target 
=20 molecule with high affinity. Display of peptide (Cwirla, S. E. et al. (1990) Proc. Natl. Acad. Sci. USA, 
jl 87:6378) or protein (Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, T. et al. (1991) Nature, 

;P 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) Proc. Natl. Acad. Sci. 

i J USA, 88:8363) libraries on phage have been used for screening millions of polypeptides or oligopeptides for 

ones with specific binding properties (Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). Sorting phage 
25 libraries of random mutants requires a strategy for constructing and propagating a large number of variants, a 
procedure for affinity purification using the target receptor, and a means of evaluating the results of binding 
enrichments. U.S. Patent Nos. 5,223,409, 5,403,484, 5,571,689, and 5,663,143. 

Although most phage display methods have used filamentous phage, lambdoid phage display systems 
(WO 95/34683; U.S. 5,627,024), T4 phage display systems (Ren, Z-J. et al. (1998) Gene 215:439; Zhu, Z. 
30 (1997) CAN 33:534; Jiang, J. et al. (1997) can 128:44380; Ren, Z-J. et al. (1997) CAN 127:215644; Ren, Z-J. 
(1996) Protein Sci. 5:1833; Efimov, V. P. et al. (1995) Virus Genes 10:173) and T7 phage display systems 
(Smith, G. P. and Scott, J.K. (1993) Methods in Enzymology, 217, 228-257; U.S. 5,766,905) are also known. 

Many other improvements and variations of the basic phage display concept have now been developed. 
These improvements enhance the ability of display systems to screen peptide libraries for binding to selected 
35 target molecules and to display functional proteins with the potential of screening these proteins for desired 
properties. Combinatorial reaction devices for phage display reactions have been developed (WO 98/14277) and 
phage display libraries have been used to analyze and control bimolecular interactions (WO 98/20169; WO 



98/20159) and properties of constrained helical peptides (WO 98/20036). WO 97/35 196 describes a method of 
isolating an affinity ligand in which a phage display library is contacted with one solution in which the ligand 
will bind to a target molecule and a second solution in which the affinity ligand will not bind to the target 
molecule, to selectively isolate binding ligands. WO 97/46251 describes a method of biopanning a random 
phage display library with an affinity purified antibody and then isolating binding phage, followed by a 
5 micropanning process using microplate wells to isolate high affinity binding phage. The use of Staphylococcus 
aureus protein A as an affinity tag has also been reported (Li et al. (1998) Mol Biotech., 9:187). W0 97/47314 
describes the use of substrate subtraction libraries to distinguish enzyme specificities using a combinatorial 
library which may be a phage display library. A method for selecting enzymes suitable for use in detergents 
using phage display is described in WO 97/09446. Additional methods of selecting specific binding proteins are 
10 described in U.S. Patent Nos. 5,498,538, 5,432,018, and WO 98/15833. 

Methods of generating peptide libraries and screening these libraries are also disclosed in U.S. Patent 
Nos. 5,723,286, 5,432,018, 5,580,717, 5,427,908, 5,498,530, 5,770,434, 5,734,018, 5,698,426, 5,763,192, 
and 5,723,323. 

C. TAT Binding Organic Molecules 

i 5 TAT binding organic molecules are organic molecules other than oligopeptides or antibodies as defined 

herein that bind, preferably specifically, to a TAT polypeptide as described herein. TAT binding organic 
molecules may be identified and chemically synthesized using known methodology (see, e.g., PCT Publication 
Nos. WO00/00823 and WO00/39585). TAT binding organic molecules are usually less than about 2000 daltons 
in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such organic molecules 

-20 that are capable of binding, preferably specifically, to a TAT polypeptide as described herein may be identified 
without undue experimentation using well known techniques. In this regard, it is noted that techniques for 

* screening organic molecule libraries for molecules that are capable of binding to a polypeptide target are well 

~ known in the art (see, e.g., PCT Publication Nos. WO00/00823 and WO00/39585). TAT binding organic 

molecules maybe, for example, aldehydes, ketones, oximes, hydrazones, semicarbazones, carbazides, primary 

25 amines, secondary amines, tertiary amines, N-substituted hydrazines, hydrazides, alcohols, ethers, thiols, 
thioethers, disulfides, carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, thioketals, acetals, 
thioacetals, aryl halides, aryl sulfonates, alkyl halides, alkyl sulfonates, aromatic compounds, heterocyclic 
compounds, anilines, alkenes, alkynes, diols, amino alcohols, oxazolidines, oxazolines, thiazolidines, thiazolines, 
enamines, sulfonamides, epoxides, aziridines, isocyanates, sulfonyl chlorides, diazo compounds, acid chlorides, 

30 or the like. 

D. Screening for Anti-TAT Antibodies. TAT Binding Oligopeptides and TAT Binding Organic 
Molecules With the Desired Properties 

Techniques for generating antibodies, oligopeptides and organic molecules that bind to TAT 
polypeptides have been described above. One may further select antibodies, oligopeptides or other organic 
35 molecules with certain biological characteristics, as desired. 

The growth inhibitory effects of an anti-TAT antibody, oligopeptide or other organic molecule of the 
invention may be assessed by methods known in the art, e.g. , using cells which express a TAT polypeptide either 



endogenously or following transfection with the TAT gene. For example, appropriate tumor cell lines and TAT- 
transfected cells may treated with an anti-TAT monoclonal antibody, oligopeptide or other organic molecule of 
the invention at various concentrations for a few days (e.g., 2-7) days and stained with crystal violet or MTT 
or analyzed by some other colorimetric assay. Another method of measuring proliferation would be by 
comparing 3 H-fhymidine uptake by the cells treated in the presence or absence an anti-TAT antibody, TAT 
5 binding oligopeptide or TAT binding organic molecule of the invention. After treatment, the cells are harvested 
and the amount of radioactivity incorporated into the DNA quantitated in a scintillation counter. Appropriate 
positive controls include treatment of a selected cell line with a growth inhibitory antibody known to inhibit 
growth of that cell line. Growth inhibition of tumor cells in vivo can be determined in various ways known in 
the art. Preferably, the tumor cell is one that overexpresses a TAT polypeptide. Preferably, the anti-TAT 

10 antibody, TAT binding oligopeptide or TAT binding organic molecule will inhibit cell proliferation of a TAT- 
expressing tumor cell in vitro or in vivo by about 25-100% compared to the untreated tumor cell, more 
preferably, by about 30-100%, and even more preferably by about 50-100% or 70-100%, in one embodiment, 
at an antibody concentration of about 0.5 to 30 /xg/ml. Growth inhibition can be measured at an antibody 
concentration of about 0.5 to 30 fig/ml or about 0.5 nM to 200 nM in cell culture, where the growth inhibition 

15 is determined 1-10 days after exposure of the tumor cells to the antibody. The antibody is growth inhibitory in 
vivo if administration of the anti-TAT antibody at about 1 /*g/kg to about 100 mg/kg body weight results in 
reduction in tumor size or reduction of tumor cell proliferation within about 5 days to 3 months from the first 
administration of the antibody, preferably within about 5 to 30 days. 

To select for an anti-TAT antibody, TAT binding oligopeptide or TAT binding organic molecule which 

20 induces cell death, loss of membrane integrity as indicated by, e.g. , propidium iodide (PI), trypan blue or 7AAD 
uptake may be assessed relative to control. A PI uptake assay can be performed in the absence of complement 
and immune effector cells . TAT polypeptide-expressing tumor cells are incubated with medium alone or medium 
containing the appropriate anti-TAT antibody (e.g, at about 10/x.g/ml), TAT binding oligopeptide or TAT binding 
organic molecule. The cells are incubated for a 3 day time period. Following each treatment, cells are washed 

25 and aliquoted into 35 mm strainer-capped 12 x 75 tubes (1ml per tube, 3 tubes per treatment group) for removal 
of cell clumps. Tubes then receive PKlO^g/ml). Samplesmaybe analyzed using a FACSC AN® flow cytometer 
and FACSCONVERT® CellQuest software (Becton Dickinson). Those anti-TAT antibodies, TAT binding 
oligopeptides or TAT binding organic molecules that induce statistically significant levels of cell death as 
determined by PI uptake may be selected as cell death-inducing anti-TAT antibodies, TAT binding oligopeptides 

30 or TAT binding organic molecules. 

To screen for antibodies, oligopeptides or other organic molecules which bind to an epitope on a TAT 
polypeptide bound by an antibody of interest, a routine cross-blocking assay such as that described in Antibodies. 
A Laboratory Manual . Cold Spring Harbor Laboratory, Ed Harlow and David Lane (1988), can be performed. 
This assay can be used to determine if a test antibody, oligopeptide or other organic molecule binds the same 

35 site or epitope as a known anti-TAT antibody. Alternatively, or additionally, epitope mapping can be performed 
by methods known in the art . For example, the antibody sequence can be mutagenized such as by alanine 
scanning, to identify contact residues. The mutant antibody is initailly tested for binding with polyclonal 



antibody to ensure proper folding. In a different method, peptides corresponding to different regions of a TAT 
polypeptide can be used in competition assays with the test antibodies or with a test antibody and an antibody 
with a characterized or known epitope. 

E. Antibody Dependent Enzyme Mediated Prodrug Therapy (ADEPT) 

The antibodies of the present invention may also be used in ADEPT by conjugating the antibody to a 
5 prodrug-activating enzyme which converts a prodrug (e.g., apeptidyl chemotherapeutic agent, see W08 1/01 145) 
to an active anti-cancer drug. See, for example, WO 88/07378 and U.S. Patent No. 4,975,278. 

The enzyme component of the immunoconjugate useful for ADEPT includes any enzyme capable of 
acting on a prodrug in such a way so as to covert it into its more active, cytotoxic form. 

Enzymes that are useful in the method of this invention include, but are not limited to, alkaline 
10 phosphatase useful for converting phosphate-containing prodrugs into free drugs; arylsulfatase useful for 
converting sulfate-containing prodrugs into free drugs; cytosine deaminase useful for converting non-toxic 5- 
fluorocytosine into the anti-cancer drug, 5-fluorouracil; proteases, such as serratia protease, thermolysin, 
subtilisin, carboxypeptidases and cathepsins (such as cathepsins B and L), that are useful for converting peptide- 

— containing prodrugs into free drugs; D-alanylcarboxypeptidases, useful for converting prodrugs that contain D- 
15 amino acid substituents; carbohydrate-cleaving enzymes such as |3-galactosidase and neuraminidase useful for 

converting glycosylated prodrugs into free drugs; P -lactamase useful for converting drugs derivatized with 0- 
Iactams into free drugs; and penicillin amidases, such as penicillin V amidase or penicillin G amidase, useful 
for converting drugs derivatized at their amine nitrogens with phenoxyacetyl or phenylacetyl groups, 
respectively, into free drugs. Alternatively, antibodies with enzymatic activity, also known in the art as 
.20 "abzymes", can be used to convert the prodrugs of the invention into free active drugs (see, e.g., Massey, 
Nature 328:457-458 (1987)). Antibody-abzyme conjugates can be prepared as described herein for delivery of 

- the abzyme to a tumor cell population. 

* The enzymes of this invention can be covalently bound to the anti-TAT antibodies by techniques well 

known in the art such as the use of the heterobifunctional crosslinking reagents discussed above. Alternatively, 

25 fusion proteins comprising at least the antigen binding region of an antibody of the invention linked to at least 
a functionally active portion of an enzyme of the invention can be constructed using recombinant DNA 
techniques well known in the art (see, e.g., Neuberger et al., Nature 312:604-608 (1984). 

F. Full-Length TAT Polypeptides 

The present invention also provides newly identified and isolated nucleotide sequences encoding 
30 polypeptides referred to in the present application as TAT polypeptides. In particular, cDNAs (partial and full- 
length) encoding various TAT polypeptides have been identified and isolated, as disclosed in further detail in 
the Examples below. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 
actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the 
35 deposited clone using routine methods in the art. The predicted amino acid sequence can be determined from 
the nucleotide sequence using routine skill. For the TAT polypeptides and encoding nucleic acids described 
herein, in some cases, Applicants have identified what is believed to be the reading frame best identifiable with 



the sequence information available at the time. 

G. Anti-TAT Antibody and TAT Polypeptide Variants 

In addition to the anti-TAT antibodies and full-length native sequence TAT polypeptides described 
herein, it is contemplated that anti-TAT antibody and TAT polypeptide variants can be prepared. Anti-TAT 
antibody and TAT polypeptide variants can be prepared by introducing appropriate nucleotide changes into the 
5 encoding DNA, and/or by synthesis of the desired antibody or polypeptide. Those skilled in the art will 
appreciate that amino acid changes may alter post-translational processes of the anti-TAT antibody or TAT 
polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring 
characteristics. 

Variations in the anti-TAT antibodies and TAT polypeptides described herein, can be made, for 
10 example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, 
for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or 
more codons encoding the antibody or polypeptide that results in a change in the amino acid sequence as 
compared with the native sequence antibody or polypeptide. Optionally the variation is by substitution of at least 
;Q one amino acid with any other amino acid in one or more of the domains of the anti-TAT antibody or TAT 
] Ml 5 polypeptide. Guidance in determining which amino acid residue may be inserted, substituted or deleted without 
;Q adversely affecting the desired activity may be found by comparing the sequence of the anti-TAT antibody or 
'"■4 TAT polypeptide with that of homologous known protein molecules and minimizing the number of amino acid 
sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing 
one amino acid with another amino acid having similar structural and/or chemical properties, such as the 
;;*£0 replacement of a leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may 
M= optionally be in the range of about 1 to 5 amino acids. The variation allowed may be determined by 

5 |f systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the 

j ^ resulting variants for activity exhibited by the full-length or mature native sequence. 

Anti-TAT antibody and TAT polypeptide fragments are provided herein. Such fragments may be 
25 truncated at the N-terminus or C-terminus, or may lack internal residues, for example, when compared with a 
full length native antibody or protein. Certain fragments lack amino acid residues that are not essential for a 
desired biological activity of the anti-TAT antibody or TAT polypeptide. 

Anti-TAT antibody and TAT polypeptide fragments may be prepared by any of a number of 
conventional techniques. Desired peptide fragments may be chemically synthesized. An alternative approach 
30 involves generating antibody or polypeptide fragments by enzymatic digestion, e . g . , by treating the protein with 
an enzyme known to cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA 
with suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique involves 
isolating and amplifying a DNA fragment encoding a desired antibody or polypeptide fragment, by polymerase 
chain reaction (PCR). Oligonucleotides that define the desired termini of the DNA fragment are employed at 
35 the 5 ' and 3 ' primers in the PCR. Preferably, anti-TAT antibody and TAT polypeptide fragments share at least 
one biological and/or immunological activity with the native anti-TAT antibody or TAT polypeptide disclosed 
herein. 
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In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 

Table 6 



. 5 


Original 


Exemplary 


Preferred 




Residue 


Substitutions 


Substitutions 






val; leu; ile 


val 




A m 

Arg (R) 


lys; gin; asn 


lys 


1U 


Asn (N) 


gin; his; lys; arg 


gin 




Asp (D) 


glu 


glu 




Cys (C) 


ser 


ser 




Gin (Q) 


asn 


asn 




Glu (E) 


asp 


asp 


15 


Gly (G) 


pro; ala 


ala 




His (H) 


asn; gin; lys; arg 


arg 




He (I) 


leu; val; met; ala; phe; 








norleucine 


leu 




Leu (L) 


norleucine; ile; val; 




<20 




met; ala; phe 


ile 




Lys (K) 


arg; gin; asn 


arg 




Met (M) 


leu; phe; ile 


leu 




Phe (F) 


leu; val; ile; ala; tyr 


leu 




Pro (P) 


ala 


ala 


•i;25 


Ser (S) 


thr 


thr 




Thr (T) 


ser 


ser 




Trp(W) 


tyr; phe 


tyr 




Tyr (Y) 


trp; phe; thr; ser 


phe 




Val (V) 


ile; leu; met; phe; 




#0 




ala; norleucine 


leu 



* Substantial modifications in function or immunological identity of the anti-TAT antibody or TAT 

polypeptide are accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) 
the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 

35 conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 
Naturally occurring residues are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

40 (4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 
Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into 
45 the remaining (non-conserved) sites. 
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The variations can be made using methods known in the art such as oligonucleotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al., Nucl. 
Acids Res . . 13:4331 (1986); Zoller et al., Nucl. Acids Res. . 10:6487 (1987)], cassette mutagenesis [Wells et 
al., Gene , 34:315 (1985)], restriction selection mutagenesis [Wells et al,, Philos. Trans. R. Soc. London SerA . 
3_17:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the anti-TAT 
5 antibody or TAT polypeptide variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such 
amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 
10 chain conformation of the variant [Cunningham and Wells, Science . 244:1081-1085 (1989)]. Alanine is also 
typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and 
exposed positions [Creighton, The Proteins . (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol. . 150 :1 
(1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 
~_ Any cysteine residue not involved in maintaining the proper conformation of the anti-TAT antibody or 

=15 TAT polypeptide also may be substituted, generally with serine, to improve the oxidative stability of the 
molecule and prevent aberrant crosslinking. Conversely, cysteine bond(s) may be added to the anti-TAT 
antibody or TAT polypeptide to improve its stability (particularly where the antibody is an antibody fragment 
such as an Fv fragment). 

A particularly preferred type of substitutional variant involves substituting one or more hypervariable 

=20 region residues of a parent antibody (e.g., a humanized or human antibody). Generally, the resulting variant(s) 
selected for further development will have improved biological properties relative to the parent antibody from 

- which they are generated. A convenient way for generating such substitutional variants involves affinity 

maturation using phage display. Briefly, several hypervariable region sites (e.g., 6-7 sites) are mutated to 
generate all possible amino substitutions at each site. The antibody variants thus generated are displayed in a 

25 monovalent fashion from filamentous phage particles as fusions to the gene III product of Ml 3 packaged within 
each particle. The phage-displayed variants are then screened for their biological activity (e.g., binding affinity) 
as herein disclosed. In order to identify candidate hypervariable region sites for modification, alanine scanning 
mutagenesis can be performed to identify hypervariable region residues contributing significantly to antigen 
binding . Alternatively, or additionally, it may be beneficial to analyze a crystal structure of the antigen-antibody 

30 complex to identify contact points between the antibody and human TAT polypeptide. Such contact residues and 
neighboring residues are candidates for substitution according to the techniques elaborated herein. Once such 
variants are generated, the panel of variants is subjected to screening as described herein and antibodies with 
superior properties in one or more relevant assays may be selected for further development. 

Nucleic acid molecules encoding amino acid sequence variants of the anti-TAT antibody are prepared 

35 by a variety of methods known in the art. These methods include, but are not limited to, isolation from a natural 
source (in the case of naturally occurring amino acid sequence variants) or preparation by oligonucleotide- 
mediated (or site-directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared 
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variant or a non-variant version of the anti-TAT antibody. 

H. Modifications of Anti-TAT Antibodies and TAT Polypeptides 

Covalent modifications of anti-TAT antibodies and TAT polypeptides are included within the scope of 
this invention. One type of covalent modification includes reacting targeted amino acid residues of an anti-TAT 
antibody or TAT polypeptide with an organic derivatizing agent that is capable of reacting with selected side 
5 chains or the N- or C- terminal residues of the anti-TAT antibody or TAT polypeptide. Derivatization with 
bifunctional agents is useful, for instance, for crosslinking anti-TAT antibody or TAT polypeptide to a water- 
insoluble support matrix or surface for use in the method for purifying anti-TAT antibodies, and vice-versa. 
Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, 

1 0 including disuccinimidyl esters such as 3 , 3 ' -dithiobis(succinimidylpropionate) , bifunctional maleimides such as 
bis-N-maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 

- groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side 

45 chains [T.E. Creighton, Proteins: Structure and Molecular Properties , W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the anti-TAT antibody or TAT polypeptide included within 
the scope of this invention comprises altering the native glycosylation pattern of the antibody or polypeptide. 
"Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 

-20 carbohydrate moieties found in native sequence anti-TAT antibody or TAT polypeptide (either by removing the 
underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or 
adding one or more glycosylation sites that are not present in the native sequence anti-TAT antibody or TAT 
polypeptide. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 
involving a change in the nature and proportions of the various carbohydrate moieties present. 

25 Glycosylation of antibodies and other polypeptides is typically either N-linked or O-linked. N-linked 

refers to the attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tripeptide 
sequences asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, are the 
recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. Thus, 
the presence of either of these tripeptide sequences in a polypeptide creates a potential glycosylation site. O- 

30 linked glycosylation refers to the attachment of one of the sugars N-aceylgalactosamine, galactose, or xylose to 
a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5-hydroxylysine may 
also be used. 

Addition of glycosylation sites to the anti-TAT antibody or TAT polypeptide is conveniently 
accomplished by altering the amino acid sequence such that it contains one or more of the above-described 
35 tripeptide sequences (for N-linked glycosylation sites). The alteration may also be made by the addition of, or 
substitution by, one or more serine or threonine residues to the sequence of the original anti-TAT antibody or 
TAT polypeptide (for O-linked glycosylation sites). The anti-TAT antibody or TAT polypeptide amino acid 



sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the anti-TAT antibody or TAT polypeptide at preselected bases such that codons are generated that will 
translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the anti-TAT antibody or TAT 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described 
5 in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. 
Biochem. . pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the anti-TAT antibody or TAT polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and 
10 described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophvs. . 259 :52 (1987) and by Edge et al., 
Anal. Biochem. . 118 :131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved 
by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al. , Meth. Enzvmol. . 138 :350 
(1987). 

Another type of covalent modification of anti-TAT antibody or TAT polypeptide comprises linking the 
15 antibody or polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. The antibody or polypeptide also may be entrapped in 
microcapsules prepared, for example, by coacervation techniques or by interfacial polymerization (for example, 
hydroxymefhylcellulose or gelatin-microcapsules and poly-(methylmethacylate) microcapsules, respectively), in 
"20 colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles 
and nanocapsules), or in macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical 
7 Sciences . 16th edition, Oslo, A., Ed., (1980). 

~. The anti-TAT antibody or TAT polypeptide of the present invention may also be modified in a way to 

form chimeric molecules comprising an anti-TAT antibody or TAT polypeptide fused to another, heterologous 

25 polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the anti-TAT antibody or TAT 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. 
The epitope tag is generally placed at the amino- or carboxyl- terminus of the anti-TAT antibody or TAT 
polypeptide. The presence of such epitope-tagged forms of the anti-TAT antibody or TAT polypeptide can be 

30 detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the anti-TAT 
antibody or TAT polypeptide to be readily purified by affinity purification using an anti-tag antibody or another 
type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies are 
well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
the flu HA tag polypeptide and its antibody 12CA5 [Field et al. , Mol. Cell. Biol. . 8:2159-2165 (1988)]; the c- 

35 myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular 
Biology . 5 :3610-3616 (1985)] ; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky 
et al. , Protein Engineering . 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., 
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BioTechnology . 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al., Science . 255:192-194 (1992)]; 
an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem. , 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA . 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the anti-TAT antibody 
or TAT polypeptide with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form 
5 of the chimeric molecule (also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of 
an IgG molecule. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted 
or inactivated) form of an anti-TAT antibody or TAT polypeptide in place of at least one variable region within 
an Ig molecule . In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH 2 and 
CH 3 , or the hinge, CHj, CH 2 and CH 3 regions of an IgGl molecule. For the production of immunoglobulin 
10 fusions see also US Patent No. 5,428, 130 issued June 27, 1995. 

I. Preparation of Anti-TAT Antibodies and TAT Polypeptides 

The description below relates primarily to production of anti-TAT antibodies and TAT polypeptides by 
culruring cells transformed or transfected with a vector containing anti-TAT antibody- and TAT polypeptide- 

^ encoding nucleic acid. It is, of course, contemplated that alternative methods, which are well known in the art, 

15 may be employed to prepare anti-TAT antibodies and TAT polypeptides. For instance, the appropriate amino 
acid sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques 
[see, e.g., Stewart et al., Solid-Phase Peptide Synthesis . W.H. Freeman Co., San Francisco, CA (1969); 
Merrifield, J. Am. Chem. Soc . 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an Applied 

-20 Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the 
anti-TAT antibody or TAT polypeptide may be chemically synthesized separately and combined using chemical 
or enzymatic methods to produce the desired anti-TAT antibody or TAT polypeptide. 

1. Isolation of DNA Encoding Anti-TAT Antibody or TAT Polypeptide 
DNA encoding anti-TAT antibody or TAT polypeptide may be obtained from a cDNA library prepared 

25 from tissue believed to possess the anti-TAT antibody or TAT polypeptide mRNA and to express it at a 
detectable level. Accordingly, human anti-TAT antibody or TAT polypeptide DNA can be conveniently obtained 
from a cDNA library prepared from human tissue. The anti-TAT antibody- or TAT polypeptide-encoding gene 
may also be obtained from a genomic library or by known synthetic procedures (e.g., automated nucleic acid 
synthesis). 

30 Libraries can be screened with probes (such as oligonucleotides of at least about 20-80 bases) designed 

to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the 
selected probe may be conducted using standard procedures, such as described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An alternative means 
to isolate the gene encoding anti-TAT antibody or TAT polypeptide is to use PCR methodology [Sambrook et 

35 al., supra : Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1995)]. 
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Techniques for screening a cDNA library are well known in the art. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32 P-labeled 
ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
5 stringency, are provided in Sambrook et al., supra . 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

10 Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 

libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra , to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 

15 Host cells are transfected or transformed with expression or cloning vectors described herein for anti- 

TAT antibody or TAT polypeptide production and cultured in conventional nutrient media modified as 
appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired 
sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by the skilled 
artisan without undue experimentation. In general, principles, protocols, and practical techniques for maximizing 

20 the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach . M. 
Butler, ed. (IRL Press, 1991) and Sambrook et al., supra . 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending on the host cell 
used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 

25 employing calcium chloride, as described in Sambrook et al., supra , or electroporation is generally used for 
prokaryotes. • Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al. , Gene . 23 :3 15 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology . 
52:456-457 (1978) can be employed. General aspects of mammalian cell host system transfections have been 

30 described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 
method of Van Solingen et al. , J.Bact. . 130:946 (1977) and Hsiao et al. , Proc. Natl. Acad. Sci. (USA) . 76:3829 
(1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g. , polybrene, poly ornithine, may 
also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in 

35 Enzvmology . 185:527-537 (1990) and Mansour et al., Nature . 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 



or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are 
publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter , Erwinia, Klebsiella, Proteus, Salmonella, 
e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. 
5 subtilis and B. licheniformis (e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 
Pseudomonas such as P. aeruginosa, and Streptomyces . These examples are illustrative rather than limiting. 
Strain W3 1 10 is one particularly preferred host or parent host because it is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
example, strain W3 110 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
10 to the host, with examples of such hosts including E. coli W3 1 10 strain 1 A2, which has the complete genotype 
tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli W3110 strain 27C7 
(ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompTkan'; E. coli 
W3110 strain 37D6, which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompT rbs7 
-=- ilvG kan r ; E. coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion 
15 mutation; andanis. co/r strain having mutant periplasmic protease disclosed in U.S. PatentNo. 4,946,783 issued 
7 August 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase 
reactions, are suitable. 

Full length antibody, antibody fragments, and antibody fusion proteins can be produced in bacteria, in 
particular when glycosylation and Fc effector function are not needed, such as when the therapeutic antibody 

20 is conjugated to a cytotoxic agent (e.g. , a toxin) and the immunoconjugate by itself shows effectiveness in tumor 
cell destruction. Full length antibodies have greater half life in circulation. Production in E. coli is faster and 
more cost efficient. For expression of antibody fragments and polypeptides in bacteria, see, e.g., U.S. 
* 5,648,237 (Carter et. al.), U.S. 5,789, 199 (Joly et al.), and U.S. 5,840,523 (Simmons et al.) which describes 
translation initiation regio (TIR) and signal sequences for optimizing expression and secretion, these patents 

25 incorporated herein by reference. After expression, the antibody is isolated from the E. coli cell paste in a 
soluble fraction and can be purified through, e.g., a protein A or G column depending on the isotype. Final 
purification can be carried out similar to the process for purifying antibody expressed e.g,, in CHO cells. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for anti-TAT antibody- or TAT polypeptide-encoding vectors. Saccharomyces cerevisiae 

30 is a commonly used lower eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach 
and Nurse, Nature . 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. PatentNo. 
4,943,529; Fleer et al., Bio/Technology . 9:968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, 
CBS4574; Louvencourt et al. , J. Bacteriol. . 154(2): 73 7-742 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus 
(ATCC 16,045), K. wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; 

35 Van den Berg et al., Bio/Technology . 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 
402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol. . 28:265-278 [1988]); Candida; 
Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA . 76:5259-5263 
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[1979]); Schwanniomyces such as Schwanniomyces occidentalis (EP 394,538 published 31 October 1990); and 
filamentous fungi such as, e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 January 
1991), and Aspergillus hosts such as A. nidulans (Ballance et al. , Biochem. Biophvs. Res. Commun. . 1 12:284- 
289 [1983]; Tilburnetal., Gene, 26:205-221 [1983]; Yeltonet al.. Proc. Natl. Acad. Sci. USA . 81: 1470-1474 
[1984]) m&A. niger (Kelly and Hynes, EMBOJ. . 4:475-479 [1985]). Methylotropic yeasts are suitable herein 
and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of 
Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, andRhodotorula. A list of specific species 
that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs . 269 
(1982). 

Suitable host cells for the expression of glycosylated anti-TAT antibody or TAT polypeptide are derived 
from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and 
Spodoptera Sf9, as well as plant cells, such as cell cultures of cotton, corn, potato, soybean, petunia, tomato, 
and tobacco. Numerous baculoviral strains and variants and corresponding permissive insect host cells from 
hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti (mosquito), Aedes albopictus (mosquito), 
Drosophila melanogaster (fruitfly), and Bombyx mori have been identified. A variety of viral strains for 
transfection are publicly available, e.g. , the L-l variant of Autographa calif ornica NPV and the Bm-5 strain of 
Bombyx mori NPV, and such viruses may be used as the virus herein according to the present invention, 
particularly for transfection of Spodoptera frugiperda cells. 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) has become a routine procedure. Examples of useful mammalian host cell lines are monkey 
kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 
cells subcloned for growth in suspension culture, Graham et al., J. Gen Virol. 36:59 (1977)); baby hamster 
kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR (CHO, Urlaub et al., Proc. Natl. 
Acad. Sci. USA 77:4216 (1980)); mouse Sertoli cells (TM4, Mather. Biol. Reprod. 23:243-251 (1980)); monkey 
kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL-1587); human 
cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver 
cells (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 
8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et al., Annals N.Y. Acad. 
ScL 383:44-68 (1982)); MRC 5 cells; FS4 cells; and a human hepatoma line (Hep G2). 

Host cells are transformed with the above-described expression or cloning vectors for anti-TAT antibody 
or TAT polypeptide production and cultured in conventional nutrient media modified as appropriate for inducing 
promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 
3. Selection and Use of a Replicable Vector 
The nucleic acid (e.g. , cDNA or genomic DNA) encoding anti-TAT antibody or TAT polypeptide may 
be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors 
are publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or 
phage. The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In 
general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. 
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Vector components generally include, but are not limited to, one or more of a signal sequence, an origin of 
replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these components employs standard 
ligation techniques which are known to the skilled artisan. 

The TAT may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a part of the anti-TAT antibody- or TAT polypeptide-encoding DNA that is inserted into 
the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group 
of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the signal 
sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and 
Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, 
the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 
90/13646 published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be 
used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or related 
species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2/x plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification 
of cells competent to take up the anti-TAT antibody- or TAT polypeptide-encoding nucleic acid, such as DHFR 
or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient 
in DHFR activity, prepared and propagated as described by Urlaub et al. , Proc. Natl. Acad. Sci. USA . 77:4216 
(1980). A suitable selection gene for use in yeast is the trp\ gene present in the yeast plasmid YRp7 
[Stinchcomb et al., Nature . 282:39 (1979); Kingsman et al., Gene . 7:141 (1979); Tschemper et al., Gene . 
10:157 (1980)]. The trp\ gene provides a selection marker for a mutant strain of yeast lacking the ability to 
grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics . 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the anti-TAT antibody- 
or TAT polypeptide-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include the P- 
lactamase and lactose promoter systems fChang et al. . Nature . 275:615(1978); Goeddel et al.. Nature . 281:544 
( 1 979)] , alkaline phosphatase, a tryptophan (tip) promoter system [Goeddel, Nucleic Acids Res. . 8 :4057 (1980); 
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EP 36,776], and hybrid promoters such as the tac promoter [deBoer et al. , Proc. Natl. Acad. Sci. USA . 80:21- 
25 (1983)] . Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably 
linked to the DNA encoding anti-TAT antibody or TAT polypeptide. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem. . 255:2073 (1980)] or other glycolytic enzymes [Hess 
et al-, J- Adv. Enzyme Reg. . 7:149 (1968); Holland, Biochemistry . 17:4900 (1978)], such as enolase, 
glyceraldehyde-3 -phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose- 
6-phosphate isomerase, 3-phosphoglyceratemutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

Anti-TAT antibody or TAT polypeptide transcription from vectors in mammalian host cells is 
controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox 
virus (UK 2,21 1,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian 
sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous 
mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 
promoters, provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding the anti-TAT antibody or TAT polypeptide by higher eukaryotes may 
be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, 
usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences 
are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, 
however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the 
late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into 
the vector at a position 5 ' or 3 ' to the anti-TAT antibody or TAT polypeptide coding sequence, but is preferably 
located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of 
transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3' , untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding anti-TAT 
antibody or TAT polypeptide. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of anti-TAT antibody 
or TAT polypeptide in recombinant vertebrate cell culture are described in Gething et al., Nature . 293:620-625 
(1981); Mantei et al., Nature . 281:40-46 (1979); EP 117,060; and EP 117,058. 
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4. Culturing the Host Cells 

The host cells used to produce the anti-TAT antibody or TAT polypeptide of this invention may be 
cultured in a variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential 
Medium ((MEM), (Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium ((DMEM), Sigma) 
are suitable for culturing the host cells. In addition, any of the media described in Ham et al. , Meth. Enz. 58:44 
(1979), Barnes et al., Anal. Biochem. 102:255 (1980), U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; 
4,560,655; or 5,122,469; WO 90/03430; WO 87/00195; or U.S. Patent Re. 30,985 may be used as culture 
media for the host cells. Any of these media may be supplemented as necessary with hormones and/or other 
growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium chloride, calcium, 
magnesium, and phosphate), buffers (such as HEPES), nucleotides (such as adenosine and thymidine), antibiotics 
(such as GENTAMYCIN™ drug), trace elements (defined as inorganic compounds usually present at final 
concentrations in the micromolar range), and glucose or an equivalent energy source. Any other necessary 
supplements may also be included at appropriate concentrations that would be known to those skilled in the art. 
The culture conditions, such as temperature, pH, and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

5. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. 
Acad. Sci. USA, 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed 
that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence TAT polypeptide or against a synthetic peptide based on 
the DNA sequences provided herein or against exogenous sequence fused to TAT DNA and encoding a specific 
antibody epitope. 

6. Purification of Anti-TAT Antibody and TAT Polypeptide 

Forms of anti-TAT antibody and TAT polypeptide may be recovered from culture medium or from host 
cell lysates. If membrane -bound, it can be released from the membrane using a suitable detergent solution (e.g. 
Triton-X 100) or by enzymatic cleavage. Cells employed in expression of anti-TAT antibody and TAT 
polypeptide can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, 
mechanical disruption, or cell lysing agents. 
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It may be desired to purify anti-TAT antibody and TAT polypeptide from recombinant cell proteins or 
polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation on 
an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation- 
exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration 
using, for example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and 
5 metal chelating columns to bind epitope-tagged forms of the anti-TAT antibody and TAT polypeptide. Various 
methods of protein purification may be employed and such methods are known in the art and described for 
example in Deutscher, Methods in Enzvmologv . 182 (1990); Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag, New York (1982). The purification step(s) selected will depend, for example, on the 
nature of the production process used and the particular anti-TAT antibody or TAT polypeptide produced. 
10 When using recombinant techniques, the antibody can be produced intracellularly, in the periplasmic 

space, or directly secreted into the medium. If the antibody is produced intracellularly, as a first step, the 
particulate debris, either host cells or lysed fragments, are removed, for example, by centrifugation or 
ultrafiltration. Carter et al., Bio/Technology 10: 163-167 (1992) describe a procedure for isolating antibodies 
-- which are secreted to the periplasmic space of E. coli. Briefly, cell paste is thawed in the presence of sodium 
15 acetate (pH 3.5), EDTA, and phenylmethylsulfonylfluoride (PMSF) over about 30 min. Cell debris can be 
= removed by centrifugation. Where the antibody is secreted into the medium, superaatants from such expression 
systems are generally first concentrated using a commercially available protein concentration filter, for example , 
an Amicon or Millipore Pellicon ultrafiltration unit. A protease inhibitor such as PMSF may be included in any 
of the foregoing steps to inhibit proteolysis and antibiotics may be included to prevent the growth of adventitious 
20 contaminants. 

a The antibody composition prepared from the cells can be purified using, for example, hydroxylapatite 

chromatography, gel electrophoresis, dialysis, and affinity chromatography, with affinity chromatography being 
I the preferred purification technique. The suitability of protein A as an affinity ligand depends on the species and 
isotype of any immunoglobulin Fc domain that is present in the antibody. Protein A can be used to purify 

25 antibodies that are based on human yl, y2 or y4 heavy chains (Lindmark et al. , J. Immunol. Meth. 62: 1-13 
(1983)). Protein G is recommended for all mouse isotypes and for human y3 (Guss et al. , EMBO J. 5 : 15671575 
(1986)). The matrix to which the affinity ligand is attached is most often agarose, but other matrices are 
available. Mechanically stable matrices such as controlled pore glass or poly(styrenedivinyl)benzene allow for 
faster flow rates and shorter processing times than can be achieved with agarose. Where the antibody comprises 

30 a C H 3 domain, the Bakerbond ABX™resin (J. T. Baker, Phillipsburg, NJ) is useful for purification. Other 
techniques for protein purification such as fractionation on an ion-exchange column, ethanol precipitation, 
Reverse Phase HPLC, chromatography on silica, chromatography on heparin SEPHAROSE™ chromatography 
on an anion or cation exchange resin (such as a polyaspartic acid column), chromatofocusing, SDS-PAGE, and 
ammonium sulfate precipitation are also available depending on the antibody to be recovered. 

35 Following any preliminary purification step(s), the mixture comprising the antibody of interest and 

contaminants may be subjected to low pH hydrophobic interaction chromatography using an elution buffer at a 
pH between about 2.5-4.5, preferably performed at low salt concentrations (e.g., from about 0-0.25M salt). 
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J. Pharmaceutical Formulations 

Therapeutic formulations of the anti-TAT antibodies, TAT binding oligopeptides, TAT binding organic 
molecules and/or TAT polypeptides used in accordance with the present invention are prepared for storage by 
mixing the antibody, polypeptide, oligopeptide or organic molecule having the desired degree of purity with 
optional pharmaceutically acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical Sciences 
16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable 
carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and 
include buffers such as acetate, Tris, phosphate, citrate, and other organic acids; antioxidants including ascorbic 
acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium 
chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such 
as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular 
weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, 
histidine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, 
mannose, or dextrins; chelating agents such as EDTA; tonicifiers such as trehalose and sodium chloride; sugars 
such as sucrose, mannitol, trehalose or sorbitol; surfactant such as polysorbate; salt-forming counter-ions such 
as sodium; metal complexes (e.g., Zn-protein complexes); and/or non-ionic surfactants such as TWEEN®, 
PLURONICS® or polyethylene glycol (PEG) . The antibody preferably comprises the antibody at a concentration 
of between 5-200 mg/ml, preferably between 10-100 mg/ml. 

The formulations herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other. 
For example, in addition to an anti-TAT antibody, TAT binding oligopeptide, or TAT binding organic molecule, 
it may be desirable to include in the one formulation, an additional antibody, e.g. , a second anti-TAT antibody 
which binds a different epitope on the TAT polypeptide, or an antibody to some other target such as a growth 
factor that affects the growth of the particular cancer. Alternatively, or additionally, the composition may further 
comprise a chemotherapeutic agent, cytotoxic agent, cytokine, growth inhibitory agent, anti-hormonal agent, 
and/or cardioprotectant. Such molecules are suitably present in combination in amounts that are effective for 
the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules) or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences . 16th edition, Osol, A. Ed. (1980). 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semi-permeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), orpoly(vinylalcohol)), polylactides (U.S. 
Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene-vinyl 
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acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT® (injectable microspheres 
composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

K- Diagnosis and Treatment with Anti-TAT Antibodies. TAT Binding Oligopeptides and TAT 
Binding Organic Molecules 

To determine TAT expression in the cancer, various diagnostic assays are available. In one 
embodiment, TAT polypeptide overexpression may be analyzed by immunohistochemistry (IHC). Parrafin 
embedded tissue sections from a tumor biopsy may be subjected to the IHC assay and accorded a TAT protein 
staining intensity criteria as follows: 

Score 0 - no staining is observed or membrane staining is observed in less than 10% of tumor cells. 

Score 1 + - a faint/barely perceptible membrane staining is detected in more than 10% of the tumor 
cells. The cells are only stained in part of their membrane. 

Score 2+ - a weak to moderate complete membrane staining is observed in more than 10% of the tumor 

cells. 

Score 3+ - a moderate to strong complete membrane staining is observed in more than 10% of the 
tumor cells. 

Those tumors with 0 or 1 + scores for TAT polypeptide expression may be characterized as not 
overexpressing TAT, whereas those tumors with 2 + or 3 + scores may be characterized as overexpressing TAT. 

Alternatively, or additionally, FISH assays such as the INFORM® (sold by Ventana, Arizona) or 
PATHVISION® (Vysis, Illinois) may be carried out on formalin-fixed, paraffin-embedded tumor tissue to 
determine the extent (if any) of TAT overexpression in the tumor. 

TAT overexpression or amplification may be evaluated using an in vivo diagnostic assay, e.g., by 
administering a molecule (such as an antibody, oligopeptide or organic molecule) which binds the molecule to 
be detected and is tagged with a detectable label (e.g. , a radioactive isotope or a fluorescent label) and externally 
scanning the patient for localization of the label. 

As described above, the anti-TAT antibodies, oligopeptides and organic molecules of the inventionhave 
various non-therapeutic applications. The anti-TAT antibodies, oligopeptides and organic molecules of the 
present invention can be useful for diagnosis and staging of TAT polypeptide-expressing cancers (e.g., in 
radioimaging). The antibodies, oligopeptides and organic molecules are also useful for purification or 
immunoprecipitation of TAT polypeptide from cells, for detection and quantitation of TAT polypeptide in vitro, 
e.g., in an ELISA or a Westernblot, to kill and eliminate TAT-expressing cells from a population of mixed cells 
as a step in the purification of other cells. 

Currently, depending on the stage of the cancer, cancer treatment involves one or a combination of the 
following therapies: surgery to remove the cancerous tissue, radiation therapy, and chemotherapy. Anti-TAT 
antibody, oligopeptide or organic molecule therapy may be especially desirable in elderly patients who do not 
tolerate the toxicity and side effects of chemotherapy well and in metastatic disease where radiation therapy has 
limited usefulness. The tumor targeting anti-TAT antibodies, oligopeptides and organic molecules of the 



invention are useful to alleviate TAT-expressing cancers upon initial diagnosis of the disease or during relapse. 
For therapeutic applications, the anti-TAT antibody, oligopeptide or organic molecule can be used alone, or in 
combination therapy with, e.g., hormones, antiangiogens, or radiolabeled compounds, or with surgery, 
cryotherapy, and/or radiotherapy. Anti-TAT antibody, oligopeptide or organic molecule treatment can be 
administered in conjunction with other forms of conventional therapy, either consecutively with, pre- or post- 
5 conventional therapy. Chemotherapeutic drugs such as TAXOTERE® (docetaxel), TAXOL® (palictaxel), 
estramustine and mitoxantrone are used in treating cancer, in particular, in good risk patients. In the present 
method of the invention for treating or alleviating cancer, the cancer patient can be administered anti-TAT 
antibody, oligopeptide or organic molecule in conjuction with treatment with the one or more of the preceding 
chemotherapeutic agents. In particular, combination therapy with palictaxel and modified derivatives (see, e.g. , 
10 EP0600517) is contemplated. The anti-TAT antibody, oligopeptide or organic molecule will be administered 
with a therapeutically effective dose of the chemotherapeutic agent. In another embodiment, the anti-TAT 
antibody, oligopeptide or organic molecule is administered in conjunction with chemotherapy to enhance the 
activity and efficacy of the chemotherapeutic agent, e.g., paclitaxel. The Physicians' Desk Reference (PDR) 
discloses dosages of these agents that have been used in treatment of various cancers. The dosing regimen and 

15 dosages of these aforementioned chemotherapeutic drugs that are therapeutically effective will depend on the 
particular cancer being treated, the extent of the disease and other factors familiar to the physician of skill in the 
art and can be determined by the physician. 

In one particular embodiment, a conjugate comprising an anti-TAT antibody, oligopeptide or organic 
molecule conjugated with a cytotoxic agent is administered to the patient. Preferably, the immunoconjugate 

20 bound to the TAT protein is internalized by the cell, resulting in increased therapeutic efficacy of the 
immunoconjugate in killing the cancer cell to which it binds. In a preferred embodiment, the cytotoxic agent 
targets or interferes with the nucleic acid in the cancer cell. Examples of such cytotoxic agents are described 
above and include maytansinoids, calicheamicins, ribonucleases and DNA endonucleases. 

The anti-TAT antibodies, oligopeptides, organic molecules or toxin conjugates thereof are administered 

25 to a human patient, in accord with known methods, such as intravenous administration, e.g.,, as a bolus or by 
continuous infusion over a period of time, by intramuscular, intraperitoneal, intracerobrospinal, subcutaneous, 
intra-articular, intrasynovial, intrathecal, oral, topical, or inhalation routes. Intravenous or subcutaneous 
administration of the antibody, oligopeptide or organic molecule is preferred. 

Other therapeutic regimens may be combined with the administration of the anti-TAT antibody, 

30 oligopeptide or organic molecule. The combined administration includes co-administration, using separate 
formulations or a single pharmaceutical formulation, and consecutive administration in either order, wherein 
preferably there is a time period while both (or all) active agents simultaneously exert their biological activities. 
Preferably such combined therapy results in a synergistic therapeutic effect. 

It may also be desirable to combine administration of the anti-TAT antibody or antibodies, oligopeptides 

35 or organic molecules, with administration of an antibody directed against another rumor antigen associated with 
the particular cancer. 
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In another embodiment, the therapeutic treatment methods of the present invention involves the 
combined administration of an anti-TAT antibody (or antibodies), oligopeptides or organic molecules and one 
or more chemotherapeutic agents or growth inhibitory agents, including co-administration of cocktails of 
different chemotherapeutic agents. Chemotherapeutic agents include estramustine phosphate, prednimustine, 
cisplatin, 5-fluorouracil, melphalan, cyclophosphamide, hydroxyurea andhydroxyureataxanes (such as paclitaxel 
5 and doxetaxel) and/or anthracycline antibiotics. Preparation and dosing schedules for such chemotherapeutic 
agents may be used according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described in Chemotherapy 
Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). 

The antibody, oligopeptide or organic molecule may be combined with an anti-hormonal compound; 

10 e.g. , an anti-estrogen compound such as tamoxifen; an anti-progesterone such as onapristone (see, EP 616 812); 
or an anti-androgen such as flutamide, in dosages known for such molecules. Where the cancer to be treated 
is androgen independent cancer, the patient may previously have been subjected to anti-androgen therapy and, 
after the cancer becomes androgen independent, the anti-TAT antibody, oligopeptide or organic molecule (and 

^ optionally other agents as described herein) may be administered to the patient. 

t5 Sometimes, it may be beneficial to also co-administer a cardioprotectant (to prevent or reduce 

-: myocardial dysfunction associated with the therapy) or one or more cytokines to the patient. In addition to the 
above therapeutic regimes, the patient may be subjected to surgical removal of cancer cells and/or radiation 
therapy, before, simultaneously with, or post antibody, oligopeptide or organic molecule therapy. Suitable 
dosages for any of the above co-administered agents are those presently used and may be lowered due to the 

20 combined action (synergy) of the agent and anti-TAT antibody, oligopeptide or organic molecule. 

For the prevention or treatment of disease, the dosage and mode of administration will be chosen by 
the physician according to known criteria. The appropriate dosage of antibody, oligopeptide or organic molecule 
will depend on the type of disease to be treated, as defined above, the severity and course of the disease, whether 
the antibody, oligopeptide or organic molecule is administered for preventive or therapeutic purposes, previous 

25 therapy, the patient's clinical history and response to the antibody, oligopeptide or organic molecule, and the 
discretion of the attending physician. The antibody, oligopeptide or organic molecule is suitably administered 
to the patient at one time or over a series of treatments. Preferably, the antibody, oligopeptide or organic 
molecule is administered by intravenous infusion or by subcutaneous injections. Depending on the type and 
severity of the disease, about 1 jug/kg to about 50 mg/kg body weight (e.g. , about 0. l-15mg/kg/dose) of antibody 

30 can be an initial candidate dosage for administration to the patient, whether, for example, by one or more 
separate administrations, or by continuous infusion. A dosing regimen can comprise administering an initial 
loading dose of about 4 mg/kg, followed by a weekly maintenance dose of about 2 mg/kg of the anti-TAT 
antibody. However, other dosage regimens may be useful. A typical daily dosage might range from about 1 
/ig/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated administrations over 

35 several days or longer, depending on the condition, the treatment is sustained until a desired suppression of 
disease symptoms occurs. The progress of this therapy can be readily monitored by conventional methods and 
assays and based on criteria known to the physician or other persons of skill in the art. 



Aside from administration of the antibody protein to the patient, the present application contemplates 
administration of the antibody by gene therapy. Such administration of nucleic acid encoding the antibody is 
encompassed by the expression "administering a therapeutically effective amount of an antibody". See, for 
example, WO96/07321 published March 14, 1996 concerning the use of gene therapy to generate intracellular 
antibodies. 

5 There are two major approaches to getting the nucleic acid (optionally contained in a vector) into the 

patient's cells; in vivo and ex vivo. For in vivo delivery the nucleic acid is injected directly into the patient, 
usually at the site where the antibody is required. For ex vivo treatment, the patient's cells are removed, the 
nucleic acid is introduced into these isolated cells and the modified cells are administered to the patient either 
directly or, for example, encapsulated within porous membranes which are implanted into the patient (see, e.g., 

10 U.S. PatentNos. 4,892,538 and 5,283,187). There are a variety of techniques available for introducing nucleic 
acids into viable cells. The techniques vary depending upon whether the nucleic acid is transferred into cultured 
cells in vitro, or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic acid into 
mammalian cells in vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE- 
dextran, the calcium phosphate precipitation method, etc. A commonly used vector for ex vivo delivery of the 

15 gene is a retroviral vector. 

- : - The currently preferred in vivo nucleic acid transfer techniques include transfection with viral vectors 

(such as adenovirus, Herpes simplex I virus, or adeno-associated virus) and lipid-based systems (useful lipids 
for lipid-mediated transfer of the gene are DOTMA, DOPE and DC-Choi, for example). For review of the 
= currently known gene marking and gene therapy protocols see Anderson et al., Science 256:808-813 (1992). 

20 See also WO 93/25673 and the references cited therein. 

The anti-TAT antibodies of the invention can be in the different forms encompassed by the definition 
= of "antibody" herein. Thus, the antibodies include full length or intact antibody, antibody fragments, native 
~ sequence antibody or amino acid variants, humanized, chimeric or fusion antibodies, immunoconjugates, and 
functional fragments thereof. In fusion antibodies an antibody sequence is fused to a heterologous polypeptide 
25 sequence. The antibodies can be modified in the Fc region to provide desired effector functions. As discussed 
in more detail in the sections herein, with the appropriate Fc regions, the naked antibody bound on the cell 
surface can induce cytotoxicity, e.g., via antibody-dependent cellular cytotoxicity (ADCC) or by recruiting 
complement in complement dependent cytotoxicity, or some other mechanism. Alternatively, where it is 
desirable to eliminate or reduce effector function, so as to minimize side effects or therapeutic complications, 
30 certain other Fc regions may be used. 

In one embodiment, the antibody competes for binding or bind substantially to, the same epitope as the 
antibodies of the invention. Antibodies having the biological characteristics of the present anti-TAT antibodies 
of the invention are also contemplated, specifically including the in vivo tumor targeting and any cell proliferation 
inhibition or cytotoxic characteristics. 
35 Methods of producing the above antibodies are described in detail herein. 

The present anti-TAT antibodies, oligopeptides and organic molecules are useful for treating a TAT- 
expressing cancer or alleviating one or more symptoms of the cancer in a mammal. Such a cancer includes 



prostate cancer, cancer of the urinary tract, lung cancer, breast cancer, colon cancer and ovarian cancer, more 
specifically, prostate adenocarcinoma, renal cell carcinomas, colorectal adenocarcinomas, lung adenocarcinomas, 
lung squamous cell carcinomas, and pleural mesothelioma. The cancers encompass metastatic cancers of any of 
the preceding. The antibody, oligopeptide or organic molecule is able to bind to at least a portion of the cancer 
cells that express TAT polypeptide in the mammal. In a preferred embodiment, the antibody, oligopeptide or 
5 organic molecule is effective to destroy or kill TAT-expressing tumor cells or inhibit the growth of such tumor 
cells, in vitro or in vivo, upon binding to TAT polypeptide on the cell. Such an antibody includes a naked anti- 
TAT antibody (not conjugated to any agent). Naked antibodies that have cytotoxic or cell growth inhibition 
properties can be further harnessed with a cytotoxic agent to render them even more potent in tumor cell 
destruction. Cytotoxic properties can be conferred to an anti-TAT antibody by, e.g., conjugating the antibody 

10 with a cytotoxic agent, to form an immunoconjugate as described herein. The cytotoxic agent or a growth 
inhibitory agent is preferably a small molecule. Toxins such as calicheamicin or a maytansinoid and analogs or 
derivatives thereof, are preferable. 

The invention provides a composition comprising an anti-TAT antibody, oligopeptide or organic 

j fi molecule of the invention, and a carrier. For the purposes of treating cancer, compositions can be administered 

|D5 to the patient in need of such treatment, wherein the composition can comprise one or more anti-TAT antibodies 
present as an immunoconjugate or as the naked antibody. In a further embodiment, the compositions can 
~| comprise these antibodies, oligopeptides or organic molecules in combination with other therapeutic agents such 
as cytotoxic or growth inhibitory agents, including chemotherapeutic agents. The invention also provides 
formulations comprising an anti-TAT antibody , oligopeptide or organic molecule of the invention, and a carrier. 

30 In one embodiment, the formulation is a therapeutic formulation comprising a pharmaceutical^ acceptable 

I i carrier. 

;F Another aspect of the invention is isolated nucleic acids encoding the anti-TAT antibodies. Nucleic 

j~ acids encoding both the H and L chains and especially the hypervariable region residues, chains which encode 
the native sequence antibody as well as variants, modifications and humanized versions of the antibody, are 
25 encompassed. 

The invention also provides methods useful for treating a TAT polypeptide-expressing cancer or 
alleviating one or more symptoms of the cancer in a mammal, comprising administering a therapeutically 
effective amount of an anti-TAT antibody, oligopeptide or organic molecule to the mammal. The antibody, 
oligopeptide or organic molecule therapeutic compositions can be administered short term (acute) or chronic, 

30 or intermittent as directed by physician. Also provided are methods of inhibiting the growth of, and killing a 
TAT polypeptide-expressing cell. 

The invention also provides kits and articles of manufacture comprising at least one anti-TAT antibody, 
oligopeptide or organic molecule. Kits containing anti-TAT antibodies, oligopeptides or organic molecules find 
use, e.g., for TAT cell killing assays, for purification or immunoprecipitation of TAT polypeptide from cells. 

35 For example, for isolation and purification of TAT, the kit can contain an anti-TAT antibody, oligopeptide or 
organic molecule coupled to beads (e.g., sepharose beads). Kits can be provided which contain the antibodies, 
oligopeptides or organic molecules for detection and quantitation of TAT in vitro, e.g., in an ELISA or a 



Western blot. Such antibody, oligopeptide or organic molecule useful for detection may be provided with a label 
such as a fluorescent or radiolabel. 

L. Articles of Manufacture and Kits 

Another embodiment of the invention is an article of manufacture containing materials useful for the 
treatment of anti-TAT expressing cancer. The article of manufacture comprises a container and a label or 
5 package insert on or associated with the container. Suitable containers include, for example, bottles, vials, 
syringes, etc. The containers may be formed from a variety of materials such as glass or plastic. The container 
holds a composition which is effective for treating the cancer condition and may have a sterile access port (for 
example the container may be an intravenous solution bag or a vial having a stopper pierceable by a hypodermic 
injection needle). At least one active agent in the composition is an anti-TAT antibody, oligopeptide or organic 
10 molecule of the invention. The label or package insert indicates that the composition is used for treating cancer. 
The label or package insert will further comprise instructions for adrriinistering the antibody, oligopeptide or 
organic molecule composition to the cancer patient. Additionally, the article of manufacture may further 
comprise a second container comprising a pharmaceutically-acceptable buffer, such as bacteriostatic water for 
= injection (BWFI), phosphate-buffered saline, Ringer's solution and dextrose solution. It may further include 
15 other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, 
~ needles, and syringes. 

Kits are also provided that are useful for various purposes , e . g . , for TAT-expressing cell killing assays , 
for purification or immunoprecipitation of TAT polypeptide from cells. For isolation and purification of TAT 
polypeptide, the kit can contain an anti-TAT antibody, oligopeptide or organic molecule coupled to beads (e.g., 
20 sepharose beads). Kits can be provided which contain the antibodies, oligopeptides or organic molecules for 
detection and quantitation of TAT polypeptide in vitro, e.g. , in an ELISA or a Western blot. As with the article 
= of manufacture, the kit comprises a container and a label or package insert on or associated with the container. 
The container holds a composition comprising at least one anti-TAT antibody, oligopeptide or organic molecule 
of the invention. Additional containers may be included that contain, e.g., diluents and buffers, control 
25 antibodies. The label or package insert may provide a description of the composition as well as instructions for 
the intended in vitro or diagnostic use. 

M. Uses for TAT Polypeptides and TAT-Polypeptide Encoding Nucleic Acids 
Nucleotide sequences (or their complement) encoding TAT polypeptides have various applications in 
the art of molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in 
30 the generation of anti-sense RNA and DNA probes. TAT-encoding nucleic acid will also be useful for the 
preparation of TAT polypeptides by the recombinant techniques described herein, wherein those TAT 
polypeptides may find use, for example, in the preparation of anti-TAT antibodies as described herein. 

The full-length native sequence TAT gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length TAT cDNA or to isolate still other cDNAs (for instance, those 
35 encoding naturally-occurring variants of TAT or TAT from other species) which have a desired sequence identity 
to the native TAT sequence disclosed herein. Optionally, the length of the probes will be about 20 to about 50 
bases. The hybridization probes may be derived from at least partially novel regions of the full length native 



nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic 
sequences including promoters, enhancer elements and introns of native sequence TAT. By way of example, 
a screening method will comprise isolating the coding region of the TAT gene using the known DNA sequence 
to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, 
including radionucleotides such as 32 P or 35 S, or enzymatic labels such as alkaline phosphatase coupled to the 
5 probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the TAT 
gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to 
determine which members of such libraries the probe hybridizes to. Hybridization techniques are described in 
further detail in the Examples below. Any EST sequences disclosed in the present application may similarly be 
employed as probes, using the methods disclosed herein. 

10 Other useful fragments of the TAT-encoding nucleic acids include antisense or sense oligonucleotides 

comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target TAT 
mRNA (sense) or TAT DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present 
invention, comprise a fragment of the coding region of TAT DNA . Such a fragment generally comprises at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 

15 oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, Stein and 
Cohen (Cancer Res. 48:2659, 1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 

= enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 

20 Such methods are encompassed by the present invention. The antisense oligonucleotides thus may be used to 
block expression of TAT proteins, wherein those TAT proteins may play a role in the induction of cancer in 
mammals. Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar- 
* phosphodiester backbones (or other sugar linkages, such as those described in WO 91/06629) and wherein such 
sugar linkages are resistant to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are 

25 stable in vivo (i.e., capable of resisting enzymatic degradation) but retain sequence specificity to be able to bind 
to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 

30 intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 
antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaPO + -mediated DNA transfection, 
35 electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic 
acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 



vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
5 ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 
10 nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

Antisense or sense RNA or DNA molecules are generally at least about 5 nucleotides in length, 
alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 
28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 
15 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 
300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 
510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 
720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 
930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, wherein in this context the term "about" 
20 means the referenced nucleotide sequence length plus or minus 10% of that referenced length. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related TAT coding sequences. 

Nucleotide sequences encoding a TAT can also be used to construct hybridization probes for mapping 
the gene which encodes that TAT and for the genetic analysis of individuals with genetic disorders. The 
25 nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When the coding sequences for TAT encode a protein which binds to another protein (example, where 
the TAT is a receptor), the TAT can be used in assays to identify the other proteins or molecules involved in 
30 the binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor TAT can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native TAT or 
a receptor for TAT. Such screening assays will include assays amenable to high-throughput screening of 
35 chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 
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cell based assays, which are well characterized in the art. 

Nucleic acids which encode TAT or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
5 stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding TAT. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
10 Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for TAT transgene incorporation 
with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding TAT introduced 
into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding TAT. Such animals can be used as tester animals for reagents thought to confer protection 
from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 
15 the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, 
Z compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of TAT can be used to construct a TAT "knock out" animal 
which has a defective or altered gene encoding TAT as a result of homologous recombination between the 

20 endogenous gene encoding TAT and altered genomic DNA encoding TAT introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
= accordance with established techniques. A portion of the genomic DNA encoding TAT can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included 

25 in the vector [see e.g., Thomas and Capecchi, Cell . 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and 
cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected [see 
e.g., Li et al., Cell . 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., 
a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem 

30 Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152] . A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create 
a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 

35 against certain pathological conditions and for their development of pathological conditions due to absence of 
the TAT polypeptide. 
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Nucleic acid encoding the TAT polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
5 Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnik et al. , Proc. Natl. Acad. Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 
10 uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in 
. ;=i the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro 
;p include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate 
;M precipitation method, etc. The currently preferred in vivo gene transfer techniques include transfection with viral 
Jq (typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al., Trends in 
;f Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with 
i f1 an agent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind 
2£) to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate 
j =?= uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which 
; J: undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262, 
4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). For review of gene 
25 marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992). 

The nucleic acid molecules encoding the TAT polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each TAT nucleic acid molecule of the present invention can be used as a chromosome marker. 
30 The TAT polypeptides and nucleic acid molecules of the present invention may also be used 

diagnostically for tissue typing, wherein the TAT polypeptides of the present invention may be differentially 
expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a normal tissue 
of the same tissue type. TAT nucleic acid molecules will find use for generating probes for PCR, Northern 
analysis, Southern analysis and Western analysis. 
35 This invention encompasses methods of screening compounds to identify those that mimic the TAT 

polypeptide (agonists) or prevent the effect of the TAT polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the TAT polypeptides 
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encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins, including e.g., inhibiting the expression of TAT polypeptide from cells. Such 
screening assays will include assays amenable to high-throughput screening of chemical libraries, making them 
particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
5 biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

All assays for antagonists are common in that they call for contacting the drug candidate with a TAT 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 

10 reaction mixture. In a particular embodiment, the TAT polypeptide encoded by the gene identified herein or the 
drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the TAT polypeptide and drying. Alternatively, an immobilized antibody, e.g. , a monoclonal antibody, specific 

r for the TAT polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 

15 by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
Z component, e.g., the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non- immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 

30 component does not carry a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

^ If the candidate compound interacts with but does not bind to a particular TAT polypeptide encoded by 

: ^ a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 

25 immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London) . 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA . 88:9578- 
9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA . 89: 5789-5793 (1991). Many 
transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting 

30 as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to 
the DNA-binding domain of GAL4, and another, in which candidate activating proteins are fused to the 
activation domain. The expression of a GALl-lacZ reporter gene under control of a GAL4-activated promoter 

35 depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
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hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a TAT polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
5 containing the product of the gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
10 described hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture 
containing the test compound indicates that the test compound interferes with the interaction of the test compound 
and its reaction partner. 

To assay for antagonists, the TAT polypeptide may be added to a cell along with the compound to be 
- screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence 

15 of the TAT polypeptide indicates that the compound is an antagonist to the TAT polypeptide. Alternatively, 
; antagonists may be detected by combining the TAT polypeptide and a potential antagonist with membrane-bound 
; TAT polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The TAT polypeptide can be labeled, such as by radioactivity, such that the number of TAT polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 

20 encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 
ligand panning and FACS sorting. Coligan et al., Current Protocols in Immun. , 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
the TAT polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the TAT polypeptide. Transfected cells that are grown on glass 

25 slides are exposed to labeled TAT polypeptide. The TAT polypeptide can be labeled by a variety of means 
including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
single clone that encodes the putative receptor. 

30 As an alternative approach for receptor identification, labeled TAT polypeptide can be photoaffmity- 

linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 

35 library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled TAT polypeptide in the presence of the candidate compound. The ability of 



the compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 
immunoglobulin with TAT polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
5 Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the TAT 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
TAT polypeptide. 

Another potential TAT polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 

10 mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature TAT polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 

£5 complementary to a region of the gene involved in transcription (triple helix - see Lee et al. , Nucl. Acids Res. . 

I 6:3073 (1979); Cooney et al., Science . 241: 456 (1988); Dervan et al., Science . 251:1360 (1991)), thereby 
preventing transcription and the production of the TAT polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the TAT polypeptide 
(antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxvnucleotides as Antisense Inhibitors of Gene 
20 Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 
to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the TAT 

• polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation site, 
e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 

25 growth factor or other relevant binding site of the TAT polypeptide, thereby blocking the normal biological 
activity of the TAT polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
30 Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For further details see, e.g. , Rossi, Current Biology . 4:469-471 (1994), and PCT publication 
No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 
35 and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
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97/33551, supra. 

These small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for those skilled in the art. 

Isolated TAT polypeptide-encoding nucleic acid can be used herein for recombinantly producing TAT 
polypeptide using techniques well known in the art and as described herein. In turn, the produced TAT 
5 polypeptides can be employed for generating anti-TAT antibodies using techniques well known in the art and 
as described herein. 

Antibodies specifically binding a TAT polypeptide identified herein, as well as other molecules 
identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various 
disorders, including cancer, in the form of pharmaceutical compositions. 
10 If the TAT polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 

antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
- region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 
1-5 protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
£ technology. See, e.g., Marasco et al, Proc. Natl. Acad. Sci. USA . 90: 7889-7893 (1993). 

The formulation herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other. 
Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for 
20 example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are 

suitably present in combination in amounts that are effective for the purpose intended. 
=- The following examples are offered for illustrative purposes only, and are not intended to limit the scope 

^ of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
25 in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
30 throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
VA. 

EXAMPLE 1 : Identification of TAT Polypeptides and/or Encoding Nucleic Acids by GEPIS 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
35 was searched and interesting EST sequences were identified by GEPIS. Gene expression profiling in silico 
(GEPIS) is a bioinformatics tool developed at Genentech, Inc. that characterizes genes of interest for new cancer 
therapeutic targets. GEPIS takes advantage of large amounts of EST sequence and library information to 
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determine gene expression profiles. GEPIS is capable of determining the expression profile of a gene based upon 
its proportional correlation with the number of its occurrences in EST databases, and it works by integrating the 
LIFESEQ® EST relational database and Genentech proprietary information in a stringent and statistically 
meaningful way. In this example, GEPIS is used to identify and cross- validate novel tumor antigens, although 
GEPIS can be configured to perform either very specific analyses or broad screening tasks. For the initial 
5 screen, GEPIS is used to identify EST sequences from the LIFESEQ® database that correlate to expression in 
a particular tissue or tissues of interest (often a tumor tissue of interest). The EST sequences identified in this 
initial screen (or consensus sequences obtained from aligning multiple related and overlapping EST sequences 
obtained from the initial screen) were then subjected to a screen intended to identify the presence of at least one 
transmembrane domain in the encoded protein. Finally, GEPIS was employed to generate a complete tissue 
10 expression profile for the various sequences of interest. Using this type of screening bioinformatics, various 
TAT polypeptides (and their encoding nucleic acid molecules) were identified as being significantly 
overexpressed in a particular type of cancer or certain cancers as compared to other cancers and/or normal non- 
cancerous tissues. The rating of GEPIS hits is based upon several criteria including, for example, tissue 
specificity, tumor specificity and expression level in normal essential and/or normal proliferating tissues. The 
15 following is a list of molecules whose tissue expression profile as determined by GEPIS evidences high tissue 
■=• expression and significant upregulation of expression in a specific tumor or tumors as compared to other tumor(s) 
- and/or normal tissues and optionally relatively low expression in normal essential and/or normal proliferating 
tissues. As such, the molecules listed below are excellent polypeptide targets for the diagnosis and therapy of 
cancer in mammals. 

20 Molecule upregulation of expression in : as compared to : 

DNA59610-1556 (TAT136) ovary tumor normal ovary tissue 

^_ EXAMPLE 2 : Tissue Expression Profiling Using GeneExpress® 

A proprietary database containing gene expression information (GeneExpress®, Gene Logic Inc., 

25 Gaithersburg, MD) was analyzed in an attempt to identify polypeptides (and their encoding nucleic acids) whose 
expression is significantly upregulated in a particular tumor tissue(s) of interest as compared to other tumor(s) 
and/or normal tissues. Specifically, analysis of the GeneExpress® database was conducted using either software 
available through Gene Logic Inc., Gaithersburg, MD, for use with the GeneExpress® database or with 
proprietary software written and developed at Genentech, Inc. for use with the GeneExpress® database. The 

30 rating of positive hits in the analysis is based upon several criteria including, for example, tissue specificity, 
tumor specificity and expression level in normal essential and/or normal proliferating tissues. The following is 
a list of molecules whose tissue expression profile as determined from an analysis of the GeneExpress® database 
evidences high tissue expression and significant upregulation of expression in a specific tumor or tumors as 
compared to other tumor(s) and/or normal tissues and optionally relatively low expression in normal essential 

35 and/or normal proliferating tissues. As such, the molecules listed below are excellent polypeptide targets for 
the diagnosis and therapy of cancer in mammals. 



100 



Molecule upregulation of expression in : as compared to : 



normal tissues tumor tissues 

DNA73730-1679 colon tumor colon uterus 

(TAT155) uterus bladder 

bladder heart 

5 heart cervix 

cervix liver 

liver pancreas 

pancreas prostate 

prostate esophagus 

10 esophagus small intestine 

small intestine testis 

testis muscle 

muscle spleen 
spleen 

15 

DNA59610-1556 ovary tumor uterus colon 

(TAT 136) stomach stomach 

spleen spleen 

skin skin 

20 rectum rectum 

prostate prostate 

^_ pancreas pancreas 

_ ovary endocrine 

nervous system nervous system 

25 lung lung 

liver liver 

kidney kidney 
endocrine 
colon 



3€ breast 

~ EXAMPLE 3 : Microarrav Analysis to Detect Upregulation of TAT Polypeptides in Cancerous Tumors 

Nucleic acid microarrays, often containing thousands of gene sequences, are useful for identifying 
differentially expressed genes in diseased tissues as compared to their normal counterparts. Using nucleic acid 

35 microarrays, test and control mRNA samples from test and control tissue samples are reverse transcribed and 
labeled to generate cDNA probes. The cDNA probes are then hybridized to an array of nucleic acids 
immobilized on a solid support. The array is configured such that the sequence and position of each member of 
the array is known. For example, a selection of genes known to be expressed in certain disease states may be 
arrayed on a solid support. Hybridization of a labeled probe with a particular array member indicates that the 

40 sample from which the probe was derived expresses that gene. If the hybridization signal of a probe from a test 
(disease tissue) sample is greater than hybridization signal of a probe from a control (normal tissue) sample, the 
gene or genes overexpressed in the disease tissue are identified. The implication of this result is that an 
overexpressed protein in a diseased tissue is useful not only as a diagnostic marker for the presence of the disease 
condition, but also as a therapeutic target for treatment of the disease condition. 

45 The methodology of hybridization of nucleic acids and microarray technology is well known in the art. 

In one example, the specific preparation of nucleic acids for hybridization and probes, slides, and hybridization 
conditions are all detailed in PCT Patent Application Serial No. PCT/US01/10482, filed on March 30, 2001 and 
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which is herein incorporated by reference. 

In the present example, cancerous tumors derived from various human tissues were studied for 
upregulated gene expression relative to cancerous tumors from different tissue types and/or non-cancerous human 
tissues in an attempt to identify those polypeptides which are overexpressed in a particular cancerous tumor(s). 
In certain experiments, cancerous human tumor tissue and non-cancerous human tumor tissue of the same tissue 
type (often from the same patient) were obtained and analyzed for TAT polypeptide expression. Additionally, 
cancerous human tumor tissue from any of a variety of different human tumors was obtained and compared to 
a "universal" epithelial control sample which was prepared by pooling non-cancerous human tissues of epithelial 
origin, including liver, kidney, and lung. mRNA isolated from the pooled tissues represents a mixture of 
expressed gene products from these different tissues. Microarray hybridization experiments using the pooled 
control samples generated a linear plot in a 2-color analysis. The slope of the line generated in a 2-color analysis 
was then used to normalize the ratios of (testxontrol detection) within each experiment. The normalized ratios 
from various experiments were then compared and used to identify clustering of gene expression. Thus, the 
pooled "universal control" sample not only allowed effective relative gene expression determinations in a simple 
2-sample comparison, it also allowed multi-sample comparisons across several experiments. 

In the present experiments, nucleic acid probes derived from the herein described TAT polypeptide- 
encoding nucleic acid sequences were used in the creation of the microarray and RN A from various tumor tissues 
were used for the hybridization thereto. Below is shown the results of these experiments, demonstrating that 
various TAT polypeptides of the present invention are significantly overexpressed in various human tumor tissues 
as compared to other human tumor tissues and/or non-cancerous human tissue(s). As described above, these 
data demonstrate that the TAT polypeptides of the present invention are useful not only as diagnostic markers 
for the presence of one or more cancerous tumors, but also serve as therapeutic targets for the treatment of those 
tumors. 

Molecule upregulation of expression in : as compared to : 

DNA59610-1556 (TAT136) ovary tumor normal ovary tissue 

epithelial control 

EXAMPLE 4 : Quantitative Analysis of TAT mRNA Expression in Prostate Tumors 

Previously it has been reported that human beneign prostatic hyperplysia (BPH) epithelial cells, 
immortalized via a large T antigen oncogene, fail to form tumors even though they survive for several months 
after grafted in immunodeficient mice. However, these cells can become tumorigenic if they are co-cultured with 
tumor-associated stromal cells prior to grafting . The conversion of the non-tumori genie cells to tumorigenic cells 
elicited by the stromal cells derived from prostate tumors appears to be permanent because when the malignant 
cells isolated from these primary tumors are grafted again in the absence of tumor-associated stromal cells, 
tumors are formed. The tumorigenic and non-tumorigenic BPH cells provide a convenient model system for gene 
profile analysis during prostate tumorigenesis. One advantage of the BPH cells is their homogeneity as compared 
to the primary prostate tumor tissue that is often heterogeneous with mixed cell types and various degrees of 
tumor grades. 
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In the present experiments, we have used RNAs extracted from 2 sublines of the tumorigenic and the 
parental, non-tumorigenic BPH cells (named BPH1CAFTD and BPH1, thereafter) and examined expression 
profiles of certain genes in these cells. As described below, we found that the expression of certain genes is 
upregulated in tumorigenic BPH cells compared to non-tumorigenic BPH cells. 

Parental BPH1 and BPH1CAFTD cells were maintained in RPMI1640 plus 5% fetal bovine serum 
5 (Gibco-BRL) in 5 % C0 2 humidified tissue culture incubator. Cells were grown to 90% confluency in 100 mm 
tissue culture dishes (Falcon). Total RNAs were isolated using RNeasy columns (Qiagen), treated with DNase 
I (Roche Molecular (BMB)) for 40 minutes at room temperature, and cleaned up by using RNeasy columns 
(Qiagen). Oligonucleotide microarray experiments were performed and analyzed as previously described (Jin 
et al., Genentech, Inc.). Briefly, after being tested for quality on test arrays, the samples were hybridized for 
10 16 hours at 45 °C to Human Genome arrays (U95A, B, C, D, E). The arrays were washed, then stained with 
streptavidin-phycoerythrin (Genome arrays were amplified with an anti-streptavidin antibody). The arrays were 
scanned with the Gene Array scanner (Agilent Technologies). Raw data were collected and analyzed using 
Affymetrix Microarray Suite and Data Mining Tools software. Experiments were done in six replicates for two 
= BPH1 samples. Mann- Whitney pair-wised comparison was performed to identify genes that were differentially 
15 expressed. Each of the six parental BPH1 samples, was compared to BPH1CAFTD samples resulting in 36 
= pairwise comparisons for both sample comparisons. Genes with the concordance exceeded 80.6% were 
considered as statistically significant (p<0.05). Gene lists from both comparisons were then loaded into 
GeneSpring software (Silicon Genetics, Redwood City CA) to look for common Affymetrix probe sets in both 
lists. This resulted in 493 probe sets increased in common to both lists and 398 in common that decreased 
30 expression relative to BPH1CAFTD. The expression of these genes was then looked at by examining Human 
Prostate Adenocarcinomas and Nodular Hyperplasias compared to Normal Prostates in silico using the 
GeneLogic GeneExpress 2000 database. 
2 The following is a list of molecules whose tissue expression profile as determined by the above analysis 

evidences high tissue expression and significant upregulation of expression in prostate tumor as compared to 
25 normal prostate tissue. As such, the molecules listed below are excellent polypeptide targets for the diagnosis 
and therapy of prostate cancer in mammals. 

Molecule upregulation of expression in : as compared to : 

DNA49143-1429 (TAT154) prostate tumor normal prostate tissue 

DNA77631-2537 (TAT156) prostate tumor normal prostate tissue 

30 

EXAMPLE 5 : Use of TAT as a hybridization probe 

The following method describes use of a nucleotide sequence encoding TAT as a hybridization probe 
for, i.e., diagnosis of the presence of a tumor in a mammal. 

DNA comprising the coding sequence of full-length or mature TAT as disclosed herein can also be 
35 employed as a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of 
TAT) in human tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled TAT-derived probe to the filters is performed in a 



solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of O.lx SSC and 0.1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence TAT can 
then be identified using standard techniques known in the art. 

5 

EXAMPLE 6 : Expression of TAT in E. coli 

This example illustrates preparation of an unglycosylated form of TAT by recombinant expression in 

E. coli. 

The DNA sequence encoding TAT is initially amplified using selected PCR primers. The primers 
10 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al., Gene . 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
?- amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
15 for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
* sequence, and enterokinase cleavage site), the TAT coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
20 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
" antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 
25 After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 

pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized 
TAT protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
protein. 

TAT may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
30 encoding TAT is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
35 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate»2H20, 1.07 g KC1, 5.36 g Difco 



yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30 °C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
5 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 

10 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

1 5 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
ofTFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 

20 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1% 
TFA with elution with a gradient of acetonitrile from 10 to 80% . Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 

25 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
from the samples. 

Fractions containing the desired folded TAT polypeptide are pooled and the acetonitrile removed using 
30 a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 1 4 M sodium chloride and 4 % mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
this technique(s). 

35 
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EXAMPLE 7 : Expression of TAT in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of TAT by recombinant 
expression in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 
Optionally, the TAT DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the TAT 
5 DNA using ligation methods such as described in Sambrook et al. , supra . The resulting vector is called pRK5- 
TAT. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 fig pRK5-TAT DNA is mixed with about 1 fig 

10 DNA encoding the VA RNA gene [Thimmappaya et al. , Cell, 31 :543 (1982)] and dissolved in 500 fd of 1 mM 
Tris-HCl, 0. 1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 fil of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 
precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 

15 then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections , the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 j^Ci/ml 35 S-cysteine and 200 /iCi/ml 35 S-methionine. After 
=. a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15 % 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 

26 presence of TAT polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 
':; In an alternative technique, TAT may be introduced into 293 cells transiently using the dextran sulfate 

method described by Somparyrac et al., Proc. Natl. Acad. ScL . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 fig pRK5-TAT DNA is added. The cells are first concentrated from 

25 the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 /u.g/ml bovine insulin and 
0. 1 /xg/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove 
cells and debris. The sample containing expressed TAT can then be concentrated and purified by any selected 

30 method, such as dialysis and/or column chromatography. 

In another embodiment, TAT can be expressed in CHO cells. The pRK5-TAT can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
35 S-methionine. After determining the presence of TAT polypeptide, the culture medium may be replaced with 

35 serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
is harvested. The medium containing the expressed TAT can then be concentrated and purified by any selected 
method. 
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Epitope-tagged TAT may also be expressed in host CHO cells. The TAT may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged TAT insert can then be subcloned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
5 above, to verify expression. The culture medium containing the expressed poly-His tagged TAT can then be 
concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

TAT may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 
cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
10 as an IgG construct (irnmunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 
and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 
~ standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology . Unit 3.16, John 
15 Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
j is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 
promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
i expression permits selection for stable maintenance of the plasmid following transfection. 

26 Twelve micrograms of the desired plasmid DNA is introduced into approximately 1 0 million CHO cells 

using commercially available transfection reagents Superfect* (Quiagen), Dosper® or Fugene* (Boehringer 
~ Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10 7 cells are frozen 
in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
25 vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
fj.m filtered PS20 with 5% 0.2 /j,m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 
spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL 
30 and 2000 mL spinners are seeded with 3 x 10 s cells/mL. The cell media is exchanged with fresh media by 
centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5, 122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
35 temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 



70% , the cell culture is harvested by centrifugation and filtering through a 0.22 filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
5 and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0. 14 M NaCl and 4% 
mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
10 conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 
ml fractions into tubes containing 275 yuL of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
15 by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
this technique(s). 

EXAMPLE 8 : Expression of TAT in Yeast 
20 The following method describes recombinant expression of TAT in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of TAT from 
the ADH2/GAPDH promoter. DNA encoding TAT and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of TAT. For secretion, DNA encoding TAT can 
be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native TAT 
25 signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of TAT. 

Yeast cells, such as yeast strain AB 1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 
precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
30 Coomassie Blue stain. 

Recombinant TAT can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing TAT may further be purified using selected column chromatography resins. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
35 this technique(s). 
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EXAMPLE 9 : Expression of TAT in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of TAT in Baculovirus-infected insect cells. 
The sequence coding for TAT is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
5 as pVL1393 (Novagen). Briefly, the sequence encoding TAT or the desired portion of the coding sequence of 
TAT such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 
10 Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 

DNA (Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
;Q al., Baculovirus expression vectors: A Laboratory Manual . Oxford: Oxford University Press (1994). 
f-f Expressed poly-his tagged TAT can then be purified, for example, by Ni 2+ -chelate affinity 

^ chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
: : 4 Rupert et al., Nature . 362: 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0. 1 mM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
|§i> in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 /urn 
\ =?= filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
4- mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 2g0 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
25 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 280 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His I0 -tagged TAT are 
pooled and dialyzed against loading buffer. 
30 Alternatively, purification of the IgG tagged (or Fc tagged) TAT can be performed using known 

chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
this technique(s). 

35 EXAMPLE 10 : Preparation of Antibodies that Bind TAT 

This example illustrates preparation of monoclonal antibodies which can specifically bind TAT. 
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Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra . Immunogens that may be employed include purified TAT, fusion proteins containing 
TAT, and cells expressing recombinant TAT on the cell surface. Selection of the immunogen can be made by 
the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the TAT immunogen emulsified in complete Freund's 
5 adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
10 by retro-orbital bleeding for testing in ELISA assays to detect anti-TAT antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of TAT. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fused (using 35 % polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
15 which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against TAT. Determination of 
"positive" hybridoma cells secreting the desired monoclonal antibodies against TAT is within the skill in the art. 
The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
20 ascites containing the anti-TAT monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
* accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography . Alternatively , 
affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

Antibodies directed against certain of the TAT polypeptides disclosed herein have been successfully 
25 produced using this technique(s). 

EXAMPLE 1 1 : Purification of TAT Polypeptides Using Specific Antibodies 

Native or recombinant TAT polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For example, pro-TAT polypeptide, mature TAT polypeptide, or pre-TAT polypeptide 
30 is purified by immunoaffinity chromatography using antibodies specific for the TAT polypeptide of interest. 

In general, an immunoaffinity column is constructed by covalently coupling the anti-TAT polypeptide antibody 

to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 

chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized in the purification of TAT polypeptide by preparing a fraction 
from cells containing TAT polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
other methods well known in the art. Alternatively, soluble TAT polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble TAT polypeptide-containing preparation is passed over the immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of TAT polypeptide {e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/TAT polypeptide binding {e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and TAT polypeptide is collected. 
Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209, USA (ATCC): 



Material 

DNA49143-1429 
DNA73730-1679 
DNA7763 1-2537 
DNA59610-1556 



ATCC Dep. No. 

203013 
203320 
203651 
209990 



Deposit Date 

June 23, 1998 
October 6, 1998 
February 9, 1999 
June 16, 1998 



These deposits were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC § 122 and the Commissioner's rules pursuant thereto (including 37 CFR § 1.14 with particular reference 
to 886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 
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The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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